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ABSTRACT 


In  solving  any  problem  on  a  digital  computer  one  must  st'^e 
within  the  computer  memory  certain  pieces  of  information,  or  data, 
upon  which  the  program  implementing  the  solution  process  makes  its 
decisions  or  performs  its  calculations.  For  virtually  all  problems 
the  manner  in  which  the  relationships  of  accessibility  among  the 
various  items  are  portrayed  by  the  organization  of  these  items  in  the 
computer  memory  can  have  a  marked  effect  upon  the  efficiency  of 
the  solution  process.  The  objective  of  the  research  reported  here 
is  the  development  of  a  rigorous  quantitative  method  for  the  automatic 
design  of  optimal  computer  memory  representations  for  data. 

To  this  end,  we  define  a  relational  model  of  data  structure 
within  which  we  can  specify  the  logical  ordering  or  structure  of  the 
data  involved  in  the  solution  of  a  given  problem.  We  then  develop  a 
decision  model  for  the  specification  of  the  storage  structures  (i.e. , 
the  computer  memory  representations)  which  can  represent  an 
arbitrary  data  structure. 


We  define  two  basic  measures  of  performance  -  a  time  cost 
function  and  a  storage  cost  function  -  for  use  in  comparing  the  rela¬ 
tive  merits  of  a  collection  of  storage  structures.  The  time  cost 
function  reflects  the  number  of  time  units  required  to  perform  cer¬ 
tain  of  a  set  of  primitive  operations  using  a  particular  storage 
structure,  and  the  storage  cost  function  reflects  the  total  number  of 
storage  units  occupied  by  the  storage  structure. 

Finally,  we  present  a  procedure  which  determines  of  the 
storage  structures  which  can  feasibly  represent  a  given  data  structure 
the  storage  structure  for  which  the  time  and  storage  costs  satisfy  cer¬ 
tain  optimality  conditions.  A  storage  structure  is  considered  to  be 
optimal  if  it  minimizes  the  time  cost  function,  subject  to  an  upper 
limit  on  its  storage  cost. 

To  demonstrate  the  feasibility  and  the  effectiveness  of  the  fech- 
niques  we  develop,  we  apply  them  to  a  problem  for  which  a  solution 
program  already  exists  and  for  which  fairly  extensive  data  about  sys¬ 
tem  performance  are  available.  Our  results  demonstrate  conclusively 
that  significant  improvement  in  pregram  efficiency  can  be  obtained  by 
applying  these  techniques,  as  opposed  to  the  historically  intuitive  and 
qualitative  methods  used  for  storage  structure  design. 
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Chapter  I 


INTPOB  UC  TION 

The  major  thrust  of  the  research  reported  here  is  the  develop¬ 
ment  of  a  procedure  for  the  automatic  design  of  optimal  schemes  for 
the  representation  of  data  within  a  computer  memory. 

In  solving  any  problem  on  a  digital  computer  one  must  store  in  the 
computer  memory  certain  pieces  of  information  upon  which  the  program 
implemenling  the  solution  process  makes  its  decisions  or  performs  its 
calculations,,  For  most  problems  involving  more  than  a  few  items  of 
information  the  manner  in  which  the  relationships  of  accessibility  among 
the  various  items  are  portrayed  by  the  organization  of  these  items  in  the 
computer  memory  can  have  a  very  marked  effect  upon  the  efficiency  of 
the  solution  process.  (As  an  extreme,  imagine  an  hypothetical  organi¬ 
zation  for  which  the  items  most  frequently  used  are  least  accessible, 

versus  an  organization  for  which  these  items  are  most  accessible.) 

Clearly,  we  should  like  to  organize  the  items  of  information  in  the 

computer  memory  in  such  a  way  as  to  minimize  the  overall  cost  of 
accessing  them.  On  the  other  hand,  we  may  also  wisn  to  minimize  or  to 
limit  the  amount  of  memory  devoted  to  the  representation  of  these  items. 
Unfortunately,  those  two  goals  are  often  in  conflict.  We  may  be  able 
to  minimize  the  cost  of  access  at  the  expense  of  the  memory  required 
to  represent  the  items  of  information  and  vice  versa,  but  seldom  can 
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we  easily  do  both. 

This  problem  is  further  complicated  by  a  lack  of  rigorous,  objec¬ 
tive  techniques  for  the  evaluation  and  comparison  of  alternative  organi¬ 
zational  schemes,  let  alone  any  such  techniques  for  their  design. 

For  the  most  part  the  design  of  a  scheme  for  the  computer  repre¬ 
sentation  of  data  (i.e. ,  the  items  of  information)  associated  with  the 
solution  of  a  given  problem  is  an  art  fraught  with  the  subjectivity  and 
the  personal  prejudices  of  the  practitioner,,  The  designer  calls  upon 
his  experiences  with  previous  systems,  combines  this  information  with 
a  good  portion  of  intuitioq  and  Voila  !  comes  up  with  the  perfect  solu¬ 
tion! 

Indeed,  this  technique  may  produce  a  good  solution  to  the  problem 
at  hand.  Li  fact,  if  the  designer  is  alert,  the  solution  is  probably 
better  than  the  other  alternatives  which  he  may  have  considered. 

He  still  has  no  assurance,  however,  that  there  is  no  better  solution  to 
the  problem.  If  the  system  in  which  his  organizational  scheme  is  to 
be  imbedded  will  see  only  limited  use,  the  designer  may  not  care. 
Otherwise,  he  may  console  himself  with  the  fact  that  he  has  done  the 
best  he  can. 

Perhaps  we  have  overstated  our  case.  The  fact  remains,  however, 
that  ihe  designers  of  data  representation  systems  have,  a  best,  prim¬ 
itive  tools  with  which  to  work. 
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Our  objective  in  this  research  has  been  to  develop  a  rigorous 
framework  within  which  we  can  discuss  the  logical  ordering  or  structure 
(or  accessibility)  of  the  items  of  information  associated  with  a  given 
problem,  certain  measures  of  performance  with  which  we  can  com¬ 
pare  objectively  the  relative  merits  of  alternative  computer  represen¬ 
tations  for  these  items,  and  finally  a  procedure  for  choosing  from  the 
possible  alternative  representations  that  representation  which  best 
satisfies  certain  given  optimality  conditions. 

The  various  sections  of  this  chapter  will  be  devoted  to  presenting 
some  pertinent  historical  background,  defining  terms,  and  describing 
in  a  qualitative  manner  our  approach  to  the  problem. 


1.1  A  Brief  history  of  Computer  Data  Representation 


Due  to  the  fact  that  computer  memories  have  been  organized  in  <t 
linear,  sequential  manner  since  the  earliest  days  of  the  stored-pro- 
gram  computer,  it  is  not  surprising  to  find  that  all  manner  of  data  has 
been  shaped  and  bent,  as  it  were,  to  fit  within  vectors  and  rectangular 
arrays  kept  in  consecutive  memory  locations. 

On  the  other  hand,  neither  is  it  surprising  to  find  those  who  would 
rebel  against  the  structure  imposed  upon  their  data  by  the  inherently 
sequential  nature  of  computer  memories. 

The  first  really  significant  break  with  the  vector  of  consecutive 
locations  was  made  in  1956  when  Newell,  Shaw,  and  Simon  [59]  intro¬ 
duced  the  concept  of  linked  allocation  and  the  pointer  in  a  memory 
organization  scheme  imbedded  in  a  list  processing  language  (IPL) 
designed  for  heuristic  problem-solving. 

For  their  purpose c  Newell,  Shaw,  and  Simon  defined  a  list  to  consist 
of  an  ordered  set  of  items  of  information  any  item  of  which  could  be 
another  list  or  an  element,  where  an  element  was  some  basic  unit  of 
information  constrained  to  fit  within  a  single  word  (i.e. ,  location) 
of  memory. 

A  list  was  implemented  by  using  a  set  of  location  words,  each  of 
which  was  subdivided  into  three  fields,  two  fields  containing  addresses 
and  one  field  containing  a  type  code.  One  address  located  the  item 
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corresponding  tc  the  given  location  word  of  the  list  and  the  other  add¬ 
ress  located  the  next  location  word  of  the  list.  The  type  code  in  each 
location  word  simply  indicated  whether  the  item  associated  with  that 
location  word  was  a  list  (0)  or  an  element  (1). 

Thus,  if  we  were  tc  define  two  lists  A  and  B  as 

A  =  (a,B,e) 

B  =(b,c,d) 

where  a,b,c,d,and  e  are  elements,  then  the  Newell,  Shaw,  and  Simon 
list  structure  representing  these  two  lists  would  appear  as  in  Figure 
1-1,  where  an  arrow  -  called  a  link  or  a  pointer  -  originating  from  a 
given  block  (representing  a  location  word)  represents  the  address  of 
the  block,  or  word,  to  which  it  points. 

The  real  innovation  of  this  data  organization  lies  in  the  fact  that 
the  addresses  oi  the  various  items  in  a  list  need  bear  no  particular 
relation  to  one  another;  that  is,  the  addresses  need  not  be  consecutive. 
This  has  several  implications.  First,  an  item  can  be  deleted  from  a 
list  simply  by  deleting  its  location  word.  Inserting  an  item  into  a  list 
is  also  facilitated.  Secondly,  location  words  on  different  lists  (or  the 
same  one,  for  that  matter}  may  contain  the  address  of  the  same  item  of 
information.  Finally,  two  or  more  lists  may  share  the  same  region 
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of  memory  with  no  list  overflowing  until  all  memory  has  been 
exhausted. 

The  most  obvious  disadvantages  of  the  scheme  are  that  the 
location  words  require  extra  memory  space,  with  the  result  that 
approximately  half  of  the  storage  used  is  taken  up  by  location 
words,  and  that  the  ability  to  computer  the  address  of  the  next  item 
on  a  list  or  to  determine  the  address  of  the  previous  item  on  the 
list  is  lost. 

The  work  performed  by  Newell,  Shaw,  and  Simon  inspired  .many 
others  to  use  the  linked  memory  scheme  (which  at  the  time  was  often 
called  NSS  memory),  and  these  techniques  gradually  evolved  as  basic 
programming  tools.  The  first  article  dealing  specifically  with 
the  application  of  linked  data  organizations  to  garden-variety 
problems  was  published  by  Carr  [  8  ]  in  1959.  In  this  article 
Carr  indicated  that  linked  lists  can  readily  be  manipulated 
in  ordinary  programming  languages  without  the  facade  or 
restrictions  of  sophisticated  list  languages .  Despite  this 
observation,  however,  list  processing  languages  continued  to 
evolve,  each  with  its  own  specific  memory  organization 

scheme. 

At  first  one-word  nodes  or  elements  were  used  for  the  linked 
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memory  schemes,  but  about  1959  the  usefulness  of  several  consecutive 
words  per  node  and  lists  structured  via  multiple  links  was  being  dis¬ 
covered.  The  first  article  dealing  specifically  with  this  idea  was 
published  by  Eoss  [65]  in  1961.  and  a  second  article  dealing  with 
multiword  list  items  was  published  by  Comfort  [14]  in  1964. 

The  growth  in  popularity  of  linked  lists  gradually  produced  such 
refinements  as  the  circular  list,  or  ring,  in  which  the  last  item  of 
the  list  contains  a  pointer  to  the  first  item,  and  the  doubly  linked  list, 
in  which  each  item  of  the  list  contains  a  pointer  to  the  previous  item 
as  well  as  the  succeeding  item.  The  origins  of  these  concepts  cannot 
be  attributed  to  specific  individuals,  probably  because  these  ideas 
occurred  naturally  to  many  people,.  On  the  other  hand,  one  of  the  main 
factors  which  lead  to  the  widespread  use  of  these  techniques  was  the 
introduction  of  certain  list-processing  langua^^s  and  systems  which 
utilized  them,  in  particular  Weizenbaum's  Symmetric  List  Processor 
(SLIP)  [78, 'i 9]  . 

Another  novel  concept,  first  introduced  in  1960  by  Perlis  and 
Thornton,  was  that  of  the  threaded  list.  In  this  scheme  each  item  of 
a  list  contained  three  fields,  one  of  which  was  a  '  de  indicating  how 
the  other  two  fields  were  to  be  inierpreted.  In  particular,  one  of 
these  fields  contained  either  an  item  of  data  or  a  pointer  to  a  sublist 
(that  is,  a  list  which  functioned  as  an  element)  and  the  other  field 
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containe  a  pointer  to  the  next  item  on  the  list  or,  if  the  item  were 
the  last  on  the  list,  a  pointer  to  the  item  for  which  this  list  served 
as  a  sublist.  To  avoid  treating  as  a  special  case  the  last  item  erf 
any  list  which  was  not  actually  a  sublist  of  any  other  list,  a  so-called 
head  was  introduced,  which  used  the  list  in  question  as  a  sublist. 
Thus,  the  list 


(a,  (b,c),  d,  e) 

where  (b,r)  acts  as  a  sublist,  would  have  the  threaded  representation 
shown  in  Figure  1-2. 

Although  this  scheme  permits  rapid  sequencing  through  levels  of 
sublists  without  having  to  "remember"  when  a  sublist  has  been  en¬ 
countered,  it  suffers  the  disadvantage  that  sublists  may  not  be  shared. 

The  fact  is  that  circular  lists ,  doubly  linked  lists ,  and  threaded 
lists  are  all  designed  to  facilitate  searching  or  sequencing  through  a 
list  and  at  the  same  time  to  maintain  the  ease  of  inserting  and  deleting 
items.  Although  additional  pointers  can  increase  somewhat  the  com¬ 
plexity  of  the  procedures  used  for  modification  and  do  require  addi¬ 
tional  storage  above  the  corresponding  simpler  linked  list,  these 
disadvantages  are  often  more  than  offset  by  increased  ease  of 
searching. 

As  wo  have  already  indicated,  one  of  the  main  factors  resitting 
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in  the  exposure  of  programmers  to  list-processing  techniques  was 
the  development  and  availability  by  the  mid-1 960's  of  a  number  of 
list  processing  languages  and  systems.  The  first  widely  used  system 
of  this  sort  was  IPL-V  [  60]  ,  a  relatively  low-level  interpretive 
system  descended  from  Newell,  Shav,  and  Simon's  IPL.  Also  fash¬ 
ioned  after  IPL  was  a  system  called  FLPL  [23  ]  ,  a  set  of  FORTRAN 
subroutines  for  list  manipulation  developed  by  Gelemter.  A  third 
system  called  LISP  [54]  ,  developed  by  McCarthy,  involved  list 
concepts  similar  to  the  first  two  but  was  implemented  in  a  somewhat 
different  manner.  The  most  recent  system  of  that  period  was 
Weizenbaum's  SLIP  [79]  . 

As  important  as  these  systems  and  all  others  of  their  type  were 
(  and  siill  arel),  they  all  suffer  from  one  major  flaw,  the  same  flaw 
which  characterized  the  vector  of  consecutive  locations  a  vhich  ir. 
fact  lead  to  their  development:  inflexibility.  Each  of  these  systems 
provides  a  fixed  type  of  structure  to  which  the  data  to  be  represented 
must  be  tailored.  Ideally  the  memory  representation  should  conform 
to  the  data  and  the  application  at  hand. 

In  1966,  however,  Knowlton  introduced  a  general.,  relatively  low- 
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level,  list  processing  system  called  L  [41,42]  ,  the  features  of  which 
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were  quickly  incorporated  into  a  number  of  other  systems.  L  allows 
the  programmer  to  choose  blocks  of  sizes  to  his  liking,  to  define  fields 


within  these  blocks  as  he  sees  fit,  and  in  general  to  manipulate  his 
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structures  as  he  desires.  While  not  a  panacea,  L  is  certainly  a  step 
in  the  right  direction. 

Dissatifaction  with  certain  aspects  of  linked  memory  schemes  (in 
addition  to  dissatisfaction  with  the  sequential  schemes)  when  applied  to 
certain  types  of  problems  caused  some  individuals  to  seek  still  further 
alternatives.  In  particular,  Feldman  [20]  and  Kovner  [67]  were  con¬ 
cerned  with  problems  in  the  area  of  artificial  intelligence  for  which  an 
associative  memory,  in  which  reference  to  information  stored  in  the 
memory  is  made  by  specifying  the  contents  of  a  part  of  a  cell  (word) 
instead  of  an  address,  would  be  decidedly  advantageous.  Unfortunately, 
the  cost  of  a  hardware  memory  of  this  type  and  of  the  size  required  for 
such  problems  was  prohibitive.  Therefore,  in  1965  Feldman  proposed 
his  version  of  a  software -simulated  associative  memory  for  a  computer 
with  conventional  memory  and  introduced  the  concept  of  hash-coding  as 
a  form  of  address  determination. 

The  basic  idea  behind  hash-code  addressing  is  that  given  the  contents 
of  some  part  of  a  memory  cell  (where  a  cell  may  consist  of  more  than 
one  word),  the  address  of  that  cell  can  be  determined  by  applying  some 
transformation  to  the  contents  given. 

In  Feldman’s  scheme  each  cell  contains  three  principal  fields 
which  we  will  designate  F^,  Fg,  and  F^,  the  contents  of  which  are 


12 


presumably  related  to  one  another  in  some  fashion.  A  triple  con¬ 
sisting  of  a  value  for  each  of  the  fields  Fp  Fg,  and  Fg  is  assigned 
an  address  in  the  memory  by  "hash-coding"  the  values  for  fields 
Fj  and  Fg.  In  Feldman’s  case  the  hash-coding  is  implemented  by 
shifting  the  value  for  field  F^  left  a  number  of  places,  performing  a 
partial  add  between  this  result  and  the  value  for  field  F9,  and  insuring 
that  the  result  is  even. 

Retrieving  the  value  of  field  Fg  given  the  values  of  fields  F^  and 
Fg  then  simply  involves  hashing  the  values  of  F^  and  Fg  to  determine 
the  address  of  the  cell  containing  the  value  sought. 

Obviously,  there  are  a  number  of  problems  which  can  arise 
in  using  such  a  scheme.  In  the  first  place  there  rr  ty  be  more  than 
one  value  for  Fg  associated  with  a  given  pair  of  values  for  F^  and  Fg. 
This  situation  is  called  multiplicity.  Secondly,  it  is  possible  that 
two  distinct  pairs  of  values  for  F^  and  Fg  may  hash  to  the  same 
address.  This  situation  is  called  overlap.  Finally,  the  cell  accessed 
by  hashing  a  particular  pair  of  values  for  F^  and  Fg  may  have  been 
used  as  a free-storage  cell  to  handle  the  overlap  or  r  ultiplicity  of 
another  cell.  This  situation  is  called  conflict  and  requires  moving 
the  present  contents  of  that  cell,  a  very  costly  operation. 

Feldman  handles  the  overlap  and  multiplicity  problems  -  and 
provides  for  such  operations  as  determining  the  values  of  F^  and  Fg 
given  the  value  of  Fg  -  by  including  certain  link  fields  in  each  cell. 
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About  a  year  after  Feldman  introduced  his  version  of  an  asso¬ 
ciative  processor,  Rovner  proposed  certain  extensions  to  the  scheme 
to  make  it  feasible  for  such  a  system  to  be  used  in  the  paged  environ¬ 
ment  of  a  virtual  memory  machine.  The  basic  concepts,  however, 
remained  unchanged. 

From  this  point  on  in  time  the  number  of  memory  representations 
has  grown  very  rapidly.  The  principles  involved  in  each  of  Lhese  ’’new" 
representations  are,  however,  basically  no  different  from  those  we 
have  presented  so  far. 

In  a  recently  published  text,  Knuth  [43  ]  presents  the  basic  principles 
of  memory  representations  in  a  very  readable  form.  In  this  reference, 
the  various  fundamental  memory  representations  are  divorced  from 
all  languages  and  systems  in  which  they  have  been  previously  imbedded 
and  are  considered  on  their  own  merits.  For  instance,  rather  than 
discussing  how  a  given  situation  might  be  handled  using  SLIP,  Knuth 
considers  how  doubly  linked  lists  (basically  the  structures  generated  by 
SLIP)  might  be  utilized. 

In  this  section  we  have  attempted  to  present  some  of  the  highlights 
in  the  evolution  of  computer  memory  data  representations.  The  reader 
should  not  expect  this  to  be  an  exhaustive  consideration  of  all  existing 
organizational  schemes,  for  it  surely  is  not.  Neither  should  the  reader 
expect  this  to  b .  a  tutorial  on  the  basic  principles  of  computer  memory 
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representations.  For  such  a  discussion  the  reader  is  referred  to  the 
text  by  Knuth.  This  section  is  intended  primarily  to  give  the  reader 
a  flavor  for  the  problems  and  considerations  involved  in  the  design  of 
data  representations  for  the  eorrputer  memory. 

1 .  2  Definitions  and  Philosophy 

Up  to  this  point  our  discussions,  out  of  necessity,  have  been 
of  a  rather  intuitive  and  subjective  nature.  We  have  :sed  sucli  impre¬ 
cise  terms  as  "computer  memory  representation"  and  "data  organi¬ 
zation  scheme"  in  an  attempt  to  describe  the  problem  with  which  we  are 
concerned.  In  this  section  we  intend  to  define  a  number  of  terms  sc 
that  our  future  discussions  may  assume  a  less  ambiguous  posture. 

We  should  note  at  the  outset  that  the  individuals  (and  the  corre¬ 
sponding  literature)  concerned  with  computer  memory  data  organi¬ 
zations  may  genei  ally  be  grouped  into  tvo  rather  ill-defined  and 
overlapping  classes.  On  the  one  hand  are  those  individuals  who  are 
primarily  concerned  with  the  representation  of  data  within  the  main 
store  (e.g. ,  core  memory)  of  a  computer,  and  on  the  other  hand  are 
those  who  are  primarily  concerned  with  the  organization  of  data  within 
the  secondary  store  (e.g. ,  drum  and  disc)  of  a  computer  or  within 
a  hierarchy  of  memory  devices.  The  former  might  be  called  "data 
structure*  enthusiasts"  and  the  latter,  "file  management  enthusiasts". 

* 

The  term  "data  structure"  is  used  here  in  a  very  broad,  generic 
sense  which  differs  from  our  later  definition  of  the  term. 
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Fundamentally,  the  problems  which  face  these  two  groups  are 
identical.  There  are  differences,  however,  in  the  respective  en¬ 
vironments  in  which  their  solutions  to  these  problems  must  operate. 

The  inherently  sequential  nature  of  accessing  information  on  a  disc 
may  dictate  a  different  organizational  scheme  than  the  random  access 
nature  of  core  memory. 

We  do  not  wish  to  dwell  upon  these  differences  (for  the  similarities 
may  more  than  outweigh  them)  but  merely  wish  to  indicate  that  there 
are  frequently  (but  not  always)  differences  in  the  terminology  used  by  the 
two  groups.  For  example,  whereas  the  data  structure  people  are 
genera  ly  concerned  with  the  interrelationships  among  data  items 
or  data  elements,  the  file  management  people  are  generally  concerned 
wuh  the  interrelationships  among  files  and  records. 

For  the  purposes  of  this  exposition  we  will  reside  in  the  camp 
of  the  data  structure  people. 

Let  us  now  proceed  with  the  definition  of  some  terms.  In  solving 
a  particular  problem  cn  a  digital  computer,  the  program  which  imple¬ 
ments  the  solution  process  operates  upon  some  collection  of  objects 
which  are  used  as  the  basis  for  decision  or  calculation.  Each  of  these 
objects  represents  an  occurrence,  or  an  instance,  of  some  physical 
or  conceptual  quantity  and  is  characterized  by  an  ordered  pair  con¬ 
sisting  of  a  data  name  and  a  data  value.  The  data  name  indicates  the 
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quantity  itself,  such  as  planet,  river,  city,  etc. ,  and  the  data  value 
indicates  the  particular  instance  of  this  quantity. 

The  data  name/data  value  ordered  pair  (or,  equivalently,  the 
object  characterized  by  the  ordered  pair)  will  be  called  a  data  item. 

A  data  item  is  denoted  by  its  corresponding  data  name,  and  an 
instance  of  a  data  item  is  a  specific  data  name/  data  value  pair.  For 
example,  the  ordered  pair  (CITY,  ANN  ARBOR)  is  an  instance  of 
the  data  item  CITY . 

The  definitions  of  data  name,  data  value,  and  data  item  which  we 
have  just  presented  are  essentially  the  same  as  the  definitions  used 
by  McCuskey[55]  ,  who  points  out  that  in  common  high-level  program¬ 
ming  language  usage  the  data  value  corresponds  to  the  "data"  which 
is  stored  in  the  computer  memory  while  the  data  name  corresponds 
to  "data  about  data"  which  appears  in  the  source  program  and  enters 
a  symbol  table  during  compilation. 

The  data  name  and  the  data  value  are  represented  in  the  computer 
memory  by  sequences  of  elementary  objects  called  symbols  which  are 
members  of  some  alphabet,  or  set  of  all  symbols,  and  which  are 
known  to  the  program.  One  possible  alphabet  is  the  EBCDIC  character 
set. 

The  ordered  pair  of  symbol  sequences  which  represent  a  data  name/ 
data  value  pair  is  called  a  data  item  description,  or  simply  a  description. 
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It  should  be  clear  that  a  given  data  item  may  have  any  of  a  number 
of  descriptions  depending  upon  the  alphabet  chosen. 

Let  us  now  examine  the  actual  steps  involved  in  solving  a  problem 
on  a  digital  computer. 

First,  of  course,  we  must  define  the  problem  to  be  solved.  This 
involves  specifying  such  things  as  the  information  (i.e. ,  the  data 
items)  which  the  solution  process  is  to  be  given  initially  and  upon  which 
it  is  to  operate,  the  algorithms  (perhaps  in  the  form  of  flow  charts) 
which  are  pertinent  to  the  solution  process,  and  finally  the  results 
which  the  solution  process  is  expected  to  determine.  These  specifi¬ 
cations  must  be  complete  and  concise  and  must  reflect  the  require¬ 
ments  of  any  solution  of  the  problem,  manual  or  automatic. 

Second,  we  must  specify  the  logical  ordering  or  structure  of  the 
various  data  items  -  that  is,  the  logical  relationships  of  accessibility 
among  the  various  data  items  -  and  the  logical  access  processes 
which  may  be  used  to  find  any  data  item  in  the  structure.  The  result 
of  this  step  is  a  specification  of  the  data  structure  of  the  data  items 
involved  in  the  solution  of  the  problem. 

Tiiird,  given  the  specification  of  the  data  structure  for  the  problem, 
we  must  determine  a  suitable  physical  organization  for  the  data  items 
within  the  computer  memory.  The  resulting  representation  is  called 
the  storage  structure  of  the  data. 
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Finally,  we  must  actually  generate  the  code  for  the  program 
required  to  implement  the  solution  process  and  the  specified  storage 
structure. 

For  the  purpose  of  this  exposition  we  will  concern  ourselves 
primarily  with  the  second  and  third  of  these  steps.  We  assume  that 
we  are  given  the  problem  definition  as  an  initial  starting  point  and 
that  we  produce  the  specifications  used  by  the  system  implementor. 

The  reader  should  note  the  distinction  we  have  made  between  the 
terms  "data  structure"  and  "storage  structure".  This  distinction 
was  first  made  by  D'Imperio  [17]  in  1964  and  later  by  Mealy  [57]  in 
1967.  The  reader  is  cautioned,  however,  that  not  all  authors  use 
these  terms  in  the  same  way.  In  fact,  it  is  frequently  the  case  that 
the  term  data  structure  is  used  to  encompass  the  spectrum  of  both 
data  structure  and  storage  structure  as  we  have  defined  them. 

In  the  context  of  our  definitions,  data  structures  arise  from  the 
interpretations  we  give  to  certain  aspects  of  a  problem  and  its  solution, 
and  they  are  not  necessarily  invariant,  inherent,  or  necessary 
characteristics  of  the  data  items  themselves. 

To  contrast  the  concepts  of  data  structure  and  storage  structure 
once  again  we  might  say  that  data  structures  are  simply  theories  of  the 
structure  of  the  real  world,  and  storage  structures  are  computer 
representations  of  these  theories. 
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It  is  patently  clear  that  even  for  the  simplest  of  data  structures 
there  are  a  multitude  of  storage  structures  which  may  be  used  to 
represent  them.  Consider  for  instance  a  collection  of  n  data  items 
dp  whf  re  i  e  {1, 2,  •  •  •  ,n},  for  which  the  corresponding  data  structure 
is  linear  and  sequential  in  nature,  such  that  data  item  d^  is  the  first 
in  the  sequence,  data  item  dn  is  the  last  in  the  sequence,  and  data 
item  d.  is  oreceeded  by  data  item  d.  „  and  followed  by  data  item  d.  , . 
Some  of  the  more  obvious  choices  of  storage  structures  which  may 
be  used  to  represent  this  data  structure  include  1)  a  vector  of  con¬ 
secutive  memory  locations,  2)  a  linked  linear  list,  and  3)  a  doubly 
linked  list. 

Herein,  of  course,  lies  our  problem:  determine  the  storage 
structure  which  best  represents  a  given  data  structure.  Before 
considering  our  approach  to  the  problem,  however,  let  us  make  some 
general  observations  concerning  storage  structures. 

Ail  storage  structures  may  be  classified  according  to  one  or  more 
of  three  basic  types  of  organization:  sequential,  list,  and  random. 
These  types,  to  which  we  alluded  in  the  previous  section,  are  described 
by  Dodd*  [19]  ,  so  our  consideration  of  them  will  be  relatively  brief. 

In  sequential  organization  data  items  are  stored  in  the  computer 
memory  in  locations  relative  to  other  data  Hems  according  to  a 

* 

Note  that  Dodd’s  treatment  of  the  subject  is  from  the  viewpoint  of  file 
management  and  his  definition  of  data  structure  corresponds  to  our 
definition  of  storage  s  ructure. 
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specified  sequence.  Most  commonly,  the  data  i+cms  are  stored  in 
consecutive  locations  of  the  memory.  We  could,  however,  use  any 
other  well  de'ined  sequence.  For  instance,  the  data  items  could  be 
stored  in  let  ..  lions  whose  addresses  are  given  by  increasing  relatively 
prime  numbers  (although  the  utility  of  such  a  scheme  might  be 
questionable). 

The  basic  concept  of  list  organization  is  that  pointers  are  used 
to  separate  the  logical  ordering  of  the  memory  locations  containing 
data  items  from  the  physical  ordering  of  these  locations.  In  general, 

a  pointer  may  be  anything  which  allows  the  accessing  mechanism 
to  locate  tne  memory  cell  containing  a  given  data  item,  but  almost 

without  exception  a  pointer  is  considered  to  be  a  value  representing 
the  address  of  the  cell  containing  the  data  item. 

Finally,  in  random  organization  data  items  are  stored  and  retrieved 
on  the  basis  of  some  predictable  relationship  between  the  data  item  and 
the  address  of  its  assigned  memory  location.  We  may  distinguish  three 
basic  types  within  random  organization:  direct  address,  dictionary 
look-up,  and  calculation. 

For  the  direct  address  method  of  random  organization  an  arbitrary 
absolute  address  is  assigned  to  a  data  item  by  the  programmer  and  this 
address  is  used  every  time  the  data  item  is  to  be  accessed. 

The  dictionary  look-up  method  involves  maintaining  a  dictionary 

or  symbol  table  containing  pairs  of  keys  and  addresses  for  the  data 

items  of  interest.  To  determine  the  location  of  a  given  data  item,  the 
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dictionary  is  searched  for  the  key  (perhaps  the  data  name)  associated 
with  the  data  item;  the  address  associated  with  this  key  indicate*,  the 
location  of  the  desired  data  item. 

Lastly,  the  calculation  method  converts  the  key  of  a  given  data 
item  into  an  address  (not  necessarily  unique)  by  performing  a  standard 
calculation  or  transformation  upon  it.  This  method  includes  the  hash¬ 
coding  scheme  we  discussed  in  the  previous  section. 

We  wish  to  reiterate  the  fact  that  most  storage  structure  schemes 
are  not  purely  sequential,  list,  or  random  in  nature,  but  utilize  the 
properties  of  all  of  these  schemes  to  lesser  or  greater  extents. 

In  later  discourse  we  will  desire  to  make  frequent  reference  to  a 
particular  class  of  sequentially  organized  storage  structures  - 
namely,  those  storage  structures  for  which  the  data  items  are  stored 
in  consecutive  cells  of  memory.  To  facilitate  our  ciscussions,  let  us 
define  such  storage  structures  to  be  stacks* .  To  be  more  specific, 
let  a  stack  be  defined  to  consist  of  a  vector,  or  block,  of  consecutive 
memory  locations  which  contains  a  number  of  data  items  ordered  in 
the  same  manner  as  the  memory  locations  which  contain  them. 

Let  us  now  conclude  our  discussion  of  the  philosophy  of  data  re¬ 
presentation  by  summarizing  its  two  most  important  points.  First, 
the  concepts  of  data  structure  and  storage  structure  are  distinct. 

$ 

Note  that  this  definition  of  the  term  stack  is  not  necessarily  a  stand¬ 
ard  definition  and  differs,  in  particular,  from  the  definition  used  by 
Knuth  [  43  ]  . 
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Data  structure  refers  to  the  logical  ordering  or  structure  of  data  as 
we  interpret  it  for  the  solution  of  a  given  problem.  Storage  structure 
refers  to  the  physical  representation  of  data  structure  within  a  com¬ 
puter  memory.  Second,  a  given  data  structure  can  invariably  be 
represented  by  a  number  of  distinct  storage  structures 

1.3  Research  Objectives 

It  is  evident  that  determination  of  a  storage  structure  to  repre¬ 
sent  the  data  structure  associated  with  the  solution  of  a  given  problem 
is  a  process  involving  a  large  number  of  tradeoffs.  By  now  the 
qualitative  implications  of  these  tradeoffs  are  relatively  well  under¬ 
stood.  What  is  needed,  however,  are  rigorous,  objective  techniques 
for  evaluation  of  the  quantitative  aspects  of  the  tradeoffs.  The  result 
of  developing  such  techniques  should  be  more  intelligent  systems 
design,  leading  in  turn  to  programs  operating  with  higher  productivity 
at  a  lower  cost. 

Our  approach  to  development  of  these  techniques  is  threefold. 
First,  we  will  develop  a  rigorous  framework  in  the  form  of  a  relational 
and  set-theoretic  model  for  the  description  of  data  structure.  The 
relational  view  of  data,  which  has  been  advocated  by  others  (in  parti¬ 
cular,  Childs  [11]  ,  Codd  [12]  ,  McCuskey[55]  ,  and  Mealy  [57] ) 
appears  to  fce  superior  in  several  respects  to  a  graph  or  network 
model  since  it  does  not  superimpose  any  structure  upon  the  data  which 
might  fall  within  the  realm  of  storage  structure. 
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Second,  we  will  develop  a  decision  model  for  specifying  the 
storage  structures  capable  of  representing  any  data  structure  as  given 
by  our  data  structure  model. 

Finally,  we  will  develop  certain  measures  of  performance  which 
enable  us  to  compare  the  time  and  storage  characteristics  of  the 
storage  structures  described  by  our  storage  structure  model.  We  will 
also  present  a  procedure  for  examining  the  measures  of  performance 
for  each  of  the  set  of  storage  structures  which  can  feasibly  represent 
a  given  data  structure  and  for  determining  that  storage  structure  for 
which  these  measures  of  performance  best  satisfy  certain  optimality 
conditions. 
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Chapter  n 


A  MATHEMATICAL  MODEL  OF  DATA  STRUCTURE 

In  order  to  address  ourselves  to  the  problem  of  choosing  an 
optimal  storage  structure  for  a  particular  collection  of  data,  we 
need  a  rigorous  framework  within  which  we  can  discuss  the 
structure  of  data. 

The  subject  of  this  chapter  then  is  the  specification  of  a 
mathematical  model,  or  abstraction,  of  data  structure. 


2. 1  Relations 


Since,  as  we  have  indicated,  our  model  of  data  structure  will  be 
a  relational  model,  we  devote  this  section  to  defining  the  concept  of 
a  relation. 

A  propositional  function  defined  on  the  Cartesian  product  Ax  B 
of  two  sets  A  and  B  is  an  expression  denoted  by  P(x,y)  which- has  the 
property  that  P(a,b),  where  a  and  b  are  sustituted  for  the  variables 
x  and  y  in  P(x,y),  is  true  or  false  for  any  ordered  pair  (a,b)  e  AxB. 

For  example,  if  A  is  the  set  of  all  composers  and  B  is  the  set 
of  all  musical  compositions,  then 

P(x,y)  =  Mx  composed  by  y" 
is  a  propositional  function  on  AxB.  In  particular, 

P(Berlioz,  Symphonic  Fantastique) 

=  "Berlioz  composed  Symphonie  Fantastique" 


and 


P(Bach,  1812  Overture) 

=  "Bach  composed  1812  Overture" 

are  true  and  false,  respectively. 

The  expression  P(x,y)  by  itself  is  called  an  open  sentence  in 
two  variables  or,  simply,  an  open  sentence. 
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A  relation  r  may  be  defined  to  consist  of  the  following: 


(1)  a  set  A 

(2)  a  set  B 

(3)  an  open  sentence  P(x,y)  in  which  P(a,b)  is  either 
true  or  false  for  any  ordered  pair  (a,b)  e  Ax  B. 

Thus,  r  is  called  a  relation  from  A  to  B  which  we  will  denote  by 

r  =  (A,  B,  P(x,y)) 

Furthermore,  if  P(a,b)  is  true,  we  will  denote  this  fact  by 

a  r  b 

and  if  P(a,b)  is  not  true,  we  will  denote  this  fact  by 

a  1  b 

Let  r  =  (A,B,P(x,y))  be  a  relation.  We  define  the  solution  set 
R  of  the  relation  r  to  consist  of  the  elements  (a,b)  in  Ax  B  for  which 
P(a,b)  is  true.  That  is, 

R  =  {(a,b)  [  ae  A,  beB,  P(a,b)  is  true} 

Notice  that  R,  the  solution  set  of  relation  r  from  A  to  B,  is  a 
subset  of  A  x  B. 

Let  R  be  any  subset  of  A  x  B.  Then  we  can  define  a  relation 

r  =  (A,  B,P(x,y))  where  P(x,y)  =  "The  ordered  pair  (x,y)  belongs 

to  R".  The  solution  set  of  this  relation  r  is  the  original  set  R. 
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Thus,  to  every  relation  r  =  (A,B,P(x,y))  there  corresponds  a  unique 
solution  set  R  which  is  a  subset  of  A  x  B,  and  to  every  subset  R  of 
Ax  B  there  corresponds  a  relation  r  =  (A,B,P(x,y))  for  which  R  is  the 
solution  set.  Since  this  one-to-one  ^ 01  despondence  exists  between 
relations  r  =  (A,B,P(x,y))  and  subsets  R  of  Ax  B,  we  can  redefine  a 
relation  as  follows: 

A  relation  r  from  A  to  B  is  a  subset  of  Ax  B. 

Although  this  definition  may  appear  somewhat  artificial,  it  has 
the  advantage  that  the  undefined  concepts  of  "open -sentence”  and 
"variable"  are  not  used. 
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2.2  A  Relational  Model  of  Data  Structure 

In  this  section  we  will  present  a  rigorous,  mathematical  model 
for  data  structure.  In  the  course  of  doing  so  we  will  define  a  rather 
large  number  of  sets.  Since  our  motive  for  defining  a  particular  set 
may  not  present  itself  until  other  sets  have  been  defined,  we  caution 
the  reader  of  this  possibility  ahead  of  time. 

We  raise  one  further  word  of  caution:  Several  distinct  sets  may 
be  designated  by  the  same  symbol  (a  capital  Greek  letter)  with  a 
superscript  used  to  differentiate  among  them.  In  general,  sets 
designated  by  a  common  symbol  and  distinguished  in  this  manner  will 
share  some  common  (subjective)  properties.  Thus,  no  special  signi¬ 
ficance  other  than  its  role  as  a  device  for  differentiation  should  be 
attached  to  the  value  of  a  superscript. 

Let  us  now  proceed  with  the  development  of  our  model  for  data 
structure. 

The  intrinsic  structure  ,  or  the  data  structure ,  of  any  collection 
of  n  data  items  may  be  described  in  the  following  manner. 

Let  the  set  A0  consist  of  the  n  data  items  in  question: 

A0  =  {d.  I  i  =1,2,-..  ,n} 
where  d^  is  the  i-th  dat...  item  in  the  collection. 
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Let  the  set  P  (capital  rho)  consist  of  all  relations  of  interest 
in  A0  (i.  e. ,  from  A0  to  A0): 


P  =  {r.lj  =1,2,...  ,kO} 


where  r.  is  the  j-th  relation  and  k°  is  the  number  of  different 
J  J  r 

relations  of  interest. 

When  we  refer  to  the  ’’relations  of  interest”,  we  mean  that  for 
the  purpose  of  solving  a  particular  problem  involving  the  data  items 
of  A0  we  may  assume  that  the  elements  ot  A0  are  related  to  one 
another  only  via  the  relations  in  P  (whereas  these  data  items  may 
actually  be  related  to  one  another  via  other  relations  as  well). 

The  set  A°  and  the  relations  a.  P  then  define  the  intrinsic 
structure  of  the  given  collection  of  n  data  items. 

This  information  in  itself  is  of  little  more  than  academic 


interest,  but  we  may  expand  upon  it  somsvhat  in  order  to  gain 

the  insight  required  of  our  model. 

Let  R.  be  the  solution  set  of  relation  r.  e  P.  R.  then  consists 
3  3  3 

of  a  set  of  ordered  pairs  (d^,  dk)  in  A0  x  A0. 

Let  us  define  a ^  to  be  a  subset  of  A0  such  that  every  element 
of  Aj  is  the  first  element  of  at  least  one  ordered  pair  (d^d^)  in  R^. 
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Further,  let  us  define  n  to  be  a  subset  of  A0  such  that  every  element 
of  n.  is  the  second  element  of  at  least  one  ordered  pair  (d^,d^)  in  R^. 

nj=  {d^  ld^e  A0,  (d^d^eR^  f  "ome  d.e  A0} 


Let 


A  = 


k° 
r 

U  A, 

3=1 


3 


and  let 


n  = 


ko 

r 

u  n 

3=1  3 


Clearly  A  and  n  are  subsets  of  A0 .  In  fact  either  A  or  II  or  both 
may  be  identically  equal  to  A0,  but  this  is  not  necessarily  the  case 
(which  is,  of  course,  our  reason  for  defining  A  and  II), 

The  set  P  may  now  be  defined  to  consist  of  relations  from  A  to  n. 
The  elements  of  P  are  exactly  the  same  as  before. 

As  a  practical  matter  we  will  require  that  A0  =  A  Un„  That  is,  we 
will  require  every  element  of  A0  to  be  related  by  at  least  one  relation 
in  P  to  at  least  one  other  element  in  A0  (not  excluding  the  possibility  of 
an  element  being  related  to  itself).  Since  we  are  concerned  with  the 
interrelationships  among  various  data  items  in  our  consideration  of 
data  structure,  we  find  a  data  item  which  is  related  to  no  other  to  be 
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singularly  uninteresting.  (There  are,  of  course,  some  interesting 
problems  associated  with  the  representation  of  elementary  items  of 
data  within  a  computer  -  such  as,  the  representation  of  rational 
numbers  -  but  these  problems  do  not  concern  us  here.) 

Let  the  cardinality  of  set  A  (which  ve  denote  by  I A  I)  be  k°  and 

a 

let  the  cardinality  of  set  IT  be  k° . 


!  A I  =  k° 
a 


in  i  =k° 

p 

As  a  matter  of  notational  convenience,  let  the  k-th  element  of  n  be 
denoted  byp^^  ,  where  ke  {1,2,  •  •  •  ,k°  }  ,  to  distinguish  it  from  the  k-th 
element  of  a  which  we  will  continue  to  denote  by  d^,  where 
ke{l,2,-°-,k°}. 

We  then  indicate  the  fact  that  some  data  item  d^  in  a  is  related  to 

some  data  item  a  in  n  via  some  relation  r.  in  P  by  the  notation  d.  r.  p  . 

k  ]  i  ]  i 

For  specific  values  of  i,  j,  and  k  we  call  d.  r.  p,^  a  relation  instance. 

I  ]  k  - - 

In  particular,  d.  r.  p^  is  a  relation  instance  if  (d.,  p^)  e  R-.  dt  is 
called  the  source  of  the  relation  instance,  Pk  is  called  the  target  of 
the  relation  instance,  and  r.  is  called  the  relation  symbol  of  the  re¬ 
lation  instance.  Note  that  we  may  use  r.  to  denote  both  a  relation  and 
a  relation  sy  .bol.  Context  should  make  clear,  however,  the  sense  in 
which  r .  is  eing  used. 
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We  now  define  a  set  Q  to  contain  all  the  relation  instances  implied 
by  the  relations  in  P. 


«  =  {dirjPkUdi,Pk)eRj?  j=l,2,***,kO} 

Corresponding  to  each  data  item  d.e  A,  where  ie  j  1  2,*  •  •  k°  }, 

i  *  a. 

there  exists  some  (nonempty)  set  P.  of  relations  such  that 

(1)  Pt  CP,  and 

(2)  for  each  relation  r.e  P.  there  exists  at  least  one  data 

v  1  J  i 

item  p.  e  II,  where  ke  -!  1 , 2,  •  •  ■  ,  k°  } .  for  which  d.  r.  p,  , 
*k  ’  1  ’  p 1  i  j  k 

P.  =  { r .  [ r .e P,  d^c  A,  (d.,  pk)c  R.  for  some  pk  c  N | 


In  other  words,  P,  consists  of  all  relations  m  P  whoso,  solution  sets 

contain  at  least  one  ordered  pair  the  first  element  of  v.hich  is  d.„ 

Corresponding  to  every  source/relal ien  symbol  pair  (d.  r  1.  where 

2 

d.  eA  and  r.e  P.  CP,  there  exists  some  set  11  .  11  of  targets,  where 

l  ]  l  ’  ? 

2 

J0e{l,2,**  *  ,n2},  such  that  for  every  target  p^  11  f  ,  (d.,pkV  R.„  That 
2 

is,  n  which  is  called  the  £-th  target  set ,  consists  of  all  data  items 

o,  e  n  which  satisfy  d,  r.  p.  . 

*k  l  j  rk 

2 

To  present  a  more  formal  definition,  we  define  the  target  set  n~ 
for  Jfe{l,2,- •  •  ,ng}  such  that 
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(i)  n< 


n 


(2)  nj  /  / 


(3)  n2  =  {pk  id..  r.  €  n  for  fixed  d..  and  r^} 


(4) 


Note  that  this  definition  does  not  preclude  the  possibility  of  two  or  more 
distinct  source/relation  symbol  pairs  having  the  same  target  set. 

Let  the  set  2^,  where  & {l, 2, 0 0 •  ,n2},  consist  °f  all  source/ 
relation  symbol  pairs  (d^.  r.),  where  d^e  A  and  ne  such  that 
(dj[,  Pk)  eR.  for  every  pfce  n2. 

Sjg  =  {(dj  r.)  Id.  r.  pk  e  L  for  all  Pj^  e  n2  } 

That  is,  2£  consists  of  all  source/relation  symbol  pairs  which  have 

n  as  target  set0  We  note  that  there  is  a  one-to-one  correspondence 

between  the  elements  of  the  set  {  2^  I  £  =  1,2,  •  »  *n2}  and  the  elements 

of  the  set  I  St  =  l,2,,,on  }.  For  convenience  we  have  assumed 

2 

that  2  f  corresponds  to  n  ^  for  all  k  {1,  2, » •  •  ,n2}0 

Let  the  set  n  be  partitioned  into  a  number  n^  £  k°  of  disjoint 
subsets  11^,  where  m  =  l,  2, 0  •  -n^,  such  th£ 


(1)  n1  O  n1  =  0  form,  /  m2 

*  U 


n. 


(2)  II  =  U  1  n1 

m  -  1  m 
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(3)  W'  nm<^= 

l,2r~=-\r-  n5e> 

for  ail  J?e{l,2,*«*n2} 

The  main  thrust  of  this  definition,  as  contained  in  condition  3,  is 
the  following.  If  targets  p  and  p  are  both  elements  of  the  set 

ki 

1  2  1  ^ 

II  ,  then  for  every  set  II  .  of  v/inch  p  is  an  element,  p  is  also  an 
m  ^  ^  K£ 

element,  and  for  every  set  11^  of  which  p,  js  an  element,  p,  is  also 
’  *2  k2  kl 

an  element.  Conversely,  if  for  every  set  II2  oi  which  pi.  is  an 

h  ki 

element  p^  is  also  an  element  and  if  for  every  sc-1  II2  of  wnich  p^ 

is  an  element  p^  is  also  ?r.  elemen* .  then  p^  and  p^  art  both  element 
1  12 

of  the  same  set 

o 

We  note  that  for  each  set  n  ^  here  k  J 1. 2,  •  •  ,n2! ,  there-  exists 
a  set  of  indices  such  that 


f  {1,2, 


[|2  ■  i  n1 

t  m 

me.  M  j, 


On  an  intuitive,  basis  we  can  view  the  sets  Ir  as  the  land  s'  sub- 

m 

sets  into  which  II  can  be.  partitioned  while  still  allowing  th*  construction 
2 

of  each  II ^  as  a  union  of  some  of  them.  Obviously,  we  can  partition 
II  into  k°  subsets,  each  of  which  contains  exactly  one  data  item. 

p  ’ 
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2  • 

Suppose,  however,  that  for  every  target  set  II  ^  in  which  data  item 


can  be  combined  into  a  single  set,  thereby  reducing  the  number  of 

subsets  to  kp-1.  Continuing  in  this  manner  we  eventually  reach 

the  point  at  which  no  two  targets  in  II  appear  together  in  any  target 
2 

set  II  in  which  one  or  the  other  appears  but  are  not  elements  of  the 

same  subset  of  the  current  partition„  At  this  point  we  cannot  further 

reduce  the  number  of  subsets  in  the  partition  and  these  subsets  define 

the  various  II  * . 

m 

Q 

Let  the  set  Ir^,  where  me  {l,  2,  °  °  •  n ^ } ,  consist  of  all  those  target 
2  1 

sets  n,  of  which  II  is  a  subset,, 

•*-  m 


n3  -  (n3  i  n*  c  1 

3  2  1 

n  simply  indicates  which  of  the  target  sets  II .  ’’use"  the  subset  H  . 
m  ^ J  &  i  m 

Given  some  data  item  d^e  A,  for  every  i\  e  P.,  there  exists  some 

2  2 
corresponding  target  set  II ^  (such  that  (d^,p^)e  for  each  p^e  n 

Let  the  set  r consist  of  all  distinct  relation  symbol/target  set  pairs 

(r.H?)  which  are  associated  with  all  sources  in  a  such  that 
.1  f 

(1)  r={(r.n3)l  d.r.  n^e  Q  forsomed.cA} 

<2>  (',"^.(*,"1^  r,n*  - 

'J  d  d  g  A,  i  /  i  such  that  d  r  U2  d  r  II3  e  n 

1  *2  1  1  h  3  £1  *2  3  f2 
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where  cL  r.  n  ^  represents  all  relation  instances  cL  r^  p^  such  that 

Pke  n^c  Clearly,  lri<kV0 

It  may  be  the  case,  of  course,  that  the  same  relation  symbol/ 

2 

target  set  pair  is  associated  with  more  than  one  source.  Let  A^  be 

2 

the  set  of  all  sources  in  A  with  which  the  q-th  element  (r.  II  )  of  r 

J 

is  associated,  where  qe  { 1, 2,  •  °  • ,  ITI}. 

Let  us  define  a  one-to-one  function  a  which  assigns  to  each  ele¬ 
ment  of  fa  unique  element  of  the  set  {1, 2, » •  • ,  Irl}. 


a  :  r-  {1,2, ,  irf} 


(Clearly,  a  maps  ronto  the  set  {1,2,°“  IT!}.)  We  may  use  the 

2 

function  cr  to  assign  an  index  value  q  to  each  element  (r.n  )  of  H 

J  ■*“ 

q  =  a(r  n^) 


Similarly,  we  may  use  the  inverse  a  of  the  function  a  to  determine 
2 

the  element  (r.n  )  of  r corresponding  to  an  index  value  q: 

J  * 

(r^  =  o'\  q) 


Then  the  set  a  may  be  defined  more  rigorously  as  follows: 
Aq  ditA-  Vj'lfe  SJ,  (r.ll^)  =  a-1  (q)  } 


where  q  e  [1,2.*** ,  :  Fi 
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Let 


a  =  {a^  I  q  =  i,2,---,  in} 

2  2  2  2 

There  may  exist  sets  A  and  A  e  A  such  that  A  =  A„  for 

ql  q2  ql  q2 

VV 

Therefore,  let  us  partition  the  set  A  into  a  number  n^<  !ri  of  sub¬ 
sets,  each  of  which  consists  of  a  collection  of  identically  equal  elements. 
Let  us  then  choose  one  element  from  each  of  the  subsets  to  form  a 
new  set  A*  of  distinct  elements.  In  particular,  let  us  define  the  set 
A*  as  follows: 


(1) 

(2) 

We  note  that 


AJ 

^  for  A^e  A.  and  m,  4  i»2 

I  A*  I  =  n  . 

4 


2 

For  each  A^  e  A*  we  may  then  define  a  set  r  c  r which  consists 
m  n; 

9 

of  all  relation  symbol /target  set  pairs  associated  with  all  A  e  A  for 

Q 

2  2 

which  A  =  A  .  Specifically,  we  define  the  set  r  for  me  {1, 2,  •  •  •  ,n  .} 
q  m  ’  m  1  ’  ’  *  4J 

su  '  that 


(1)  r  /  0 

(2)  rm  =  {(rjn2£)  1  (rjn2^€  r’dirjnf  6  n  for  a11  di€  Ame  A*f 

(3)  T  /  T  if  m,  /  m0 

'  m,  mn  1  2 
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Now  in  the  same  manner  as  we  did  for  II  ^  let  us  partition  the  set 
A  into  a  number  n_  <  k°  of  disjoint  subsets  aJ,  where  f  =  1  2,* « *nq, 

u  3.  x,7 

such  that 


(1)  aJ  PlAj  .0  for  Jtj/i, 

1  2 

(2)  A.  Aj 


(3)  d  ,d.  £  aJ 

h  1 2  * 


(d.  e  A*  ' 
v  ij  m 


>  d.  e  A  ) 
l  m' 


for  all  m  g  {1, 2, •  •  •  ,n.} 


(i.e. ,  for  all  A^  e  A*) 


It  follows  that  for  each  set  Aj^  e  A* ,  where  me  {1, 2,  *  °  •  ,n4) , 

there  exists  a  set  L  of  indices  such  that 

m 

L  c{l  2,  *  *  *  ,n0} 
m  L  ’  ’  ’  3J 


a2=U  a! 

nl  feL  1 
m 


Fmally ,  let  the  set  A^,  where  k  {l,  2,  •  •  •  ,n3} ,  consist  of  all  those 
2  1 

sets  A  e  A*  of  which  A is  a  subset, 
m  ft 
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o  2  1 

A  ^  simply  indicates  which  of  the  sets  Am  "use”  the  subset  A^. 

2.2.1  Schematic  Representation 

Our  model  for  the  data  structure  of  an  arbitrary  collection  of 

1  3 

elementary  items  of  data  now  consists  of  the  sets  A,  n,  P^,  A^, 

A^  ,  r  .  S,,.  n?,  n3  and  n1  as  well  as  our  original  sets  A°  and  P0 
In  order  to  clarify  the  relationships  among  these  sets,  we  intro¬ 
duce  the  schematic  representation  of  Figure  2-1.  We  may  view  this 
structure  as  an  undirected  graph  consisting  of  certain  nodes,  or 
vertices,  connected  by  (undirected)  branches,  or  edges. 

Each  ring  of  the  structure  of  Figure  2-1  contains  some  number 
k°+  1  of  nodes  (where  the  value  of  k°  varies  from  ring  to  ring).  Of 
these  k°+  1  nodes  k°  act  in  a  ’'connective"  capacity  and  the  remaining 
node  represents  some  set.  In  particular,  for  any  ring  in  the  structure 
those  nodes  of  the  ring  which  lie  on  a  level  designated  by  an  odd  number 
are  the  connective  nodes  and  that  node  which  lies  on  a  level  designated 
by  an  even  number  is  the  set  node.  In  a  general  sense,  we  may  inter¬ 
pret  the  connective  nodes  of  a  ring  as  indicating  the  elements  of  the 
set  represented  by  the  set  node  of  the  ring. 

Before  proceeding  with  the  interpretation  of  the  structure  of 
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CM  fO  < 


Figure  2-1  let  us  point  out  again  the  one-to-one  correspondence  which 
exists  between  sets  of  certain  types.  For  each  set  A^  ,  where 

q 

l  e  {l,  2,  • 0  ’n^},  there  is  a  corresponding  (unique)  set  A^.  Similarly, 
o 

for  each  set  a^,  where  me  {l,2,°  °  'n^},  there  is  a  corresponding  set 
rm;  for  each  set  n^,  where  £e  {1, 2,  •  •  •  ng} ,  there  is  a  corresponding 
set  and  for  each  set  n^f  where  me{l,2,**  •  ,n^},  there  is  a 

3 

corresponding  set  II  . 

In  the  structure  of  Figure  2-1  each  node  at  level  2  (which  is  a  set 

1  3 

node)  represents  a  pair  of  sets  (A^,  A^)  for  jfe{l,  2,  •  •  • 

2 

Similarly,  each  node  at  level  4  represents  a  pair  of  sets  (Am,r  )  for 

me  {  1, 2,  •  -  •  ,n4};  each  node  at  level  6  represents  a  pair  of  sets 

<=!•  n^)  for  &{l,2, ••*,n2};  and  each  node  at  level  8  represents  a 
3  1 

pair  of  sets  (11^,  IT^)  for  me  {l,  2,  *  •  •  ,n^}„  For  each  of  these  set 
pairs  the  ring  above  the  corresponding  node  (i„e. ,  the  connective 
nodes  in  that  ring)  indicates  the  composition  of  the  first  set  in  the 
pair,  and  the  ring  below  that  node  indicates  the  composition  of  the 
second  set  in  the  pair. 

Each  node  at  level  1  represents  some  distinct  data  item  d.e  A  and 
each  node  at  level  9  represents  some  distinct  data  item  p^ell,  with  the 
result  that  the  set  A  is  represented  by  the  entire  collection  of  nodes  at 
level  1  and  the  set  n  is  represented  by  the  entire  collection  of  nodes 
at  level  9. 

1 

For  a  given  set  A^;  which  is  represented  by  some  node  at  level  2, 
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the  elements  contained  therein  are  represented  by  those  nodes  at 
level  1  which  are  contained  within  the  ring  associated  with  the  given 
node  at  level  2. 

Similarly,  for  a  given  set  which  is  represented  by  some  node 
at  level  8,  the  elements  contained  therein  are  represented  by  those 
nodes  at  level  9  which  are  contained  within  the  ring  associated  with 
the  given  node  at  level  8. 

2 

Next  consider  some  set  A  ,  which  i  s  represented  by  a  node  at 

2 

level  4.  The  elements  of  A  are  represented  by  those  nodes  at  level  1 

m 

which  are  associated  with  the  nodes  at  level  2  which  have  nodes  at  level 

3  in  common  with  the  ring  associated  with  the  node  (at  level  4)  repre- 

2  1 
senting  Am-  Stated  somewhat  differently,  the  sets  A^  which  are  sub- 

2  2 
sets  of  Am  (and  which  collectively  contain  the  elements  of  Am)  are 

represented  by  those  nodes  at  level  2  which  share  (in  their  rings  that  pass 

through  level  3)  nodes  which  are  members  of  the  ring  that  passes 

2 

through  level  3  and  is  associated  with  the  node  which  represents  a  . 

We  note  that  any  two  rings  which  have  at  least  one  node  in  common 

have  in  fact  exactly  one  node  in  common. 

2 

Each  set  II  which  is  represented  by  some  node  at  level  6,  is 

2 

treated  at  levels  6,7,8  and  9  in  a  manner  analogous  to  the  sets  A  . 

3 

Consider  next  some  set  A which  is  represented  by  anode  at 

3 

level  2.  The  elements  of  A^  are  represented  by  nodes  at  level  4 
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2 

(which  represent  the  sets  Am)  having  rings  passing  through  nodes  at 
level  3  which  are  members  of  the  ring  associated  with  the  node 
representing  A^. 

3 

Again,  the  analogous  situation  applies  for  each  set  at  levels 
6, 7, and  8. 


The  only  difference  between  the  situations  described  above  and 

that  for  the  sets  r  and  2.  is  that  the  connective  nodes  at  level  5 
m  £ 

represent  relation  symbols  in  addition  to  performing  their  connectivity 
functions.  For  example,  given  some  set  r  ,  which  is  represented 
by  a  node  at  Wei  4,  each  element  of  this  set  is  represented  by  a  node 
at  level  5  (representing  a  relation  symbol)  and  a  node  at  level  6 
(representing  a  target  set  n^)  which  node  has  a  ring  passing  through 
the  relation  symbol  node  at  level  5. 

It  follows  also  that  each  relation  r^e  P  (or  rather  the  ordered  pairs 
(d- ,  p, )  which  form  the  corresponding  solution  set  R.)  is  represented 

by  those  pairs  of  nodes  at  level  1  and  9  which  are  associated  with  the 
nodes  at  level  5  representing  the  relation  symbol  r^.  (Note  that,  in 
general,  there  may  be  more  than  one  node  at  level  5  which  represents 
the  same  relation  symbol  -  in  different  relation  instances,  of  course.) 


2.2.2  Implications  of  the  Model 

Clearly,  there  is  more  information  by  the  collection  of  sets  in 
our  data  structure  model  than  we  have  explicitly  discussed  in  our 
initial  description  of  the  schematic  of  Figure  2-1. 
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9 

For  example,  the  set  A^,  where  fe{l,2,*  •  •  ,ng},  indicates  which 

sets  A2  contain  the  set  A*  as  a  subset.  However,  since  for  every 
m  £ 

set  A^,  where  me  {1, 2,  *  ■  •  ,n4},  there  exists  a  corresponding  set  rm  , 

3 

the  set  A  „  also  indicates  ail  sets  r  which  are  associated  with  the  set 
£  m 

1  3 

A^.  Hence,  the  set  A^  indicates  (indirectly)  all  relation  symbol/target 
set  pairs  which  are  associated  with  the  sources  in  the  set  A^. 

9 

In  a  similar  manner  the  set  11°  where  me  {1,2, °°°n  },  indicates 
(indirectly)  all  source  /  relation  symbol  pairs  which  are  associated  with 
the  targets  of  the  set  n^„ 

We  can,  of  course,  continue  'ith  this  line  of  reasoning  to  determine 
which  targets  are  associated  with  a  particular  source/relation  symbol 
pair,  or  which  sources  are  associated  with  a  particular  relation  symbol/ 
target  pair,  and  so  forth.  In  short,  given  the  sets  defined  for  our  data 
structure  model,  we  can  determine  any  fact  concerning  the  inter¬ 
relationships  of  the  elements  of  a  given  collection  of  elementary  data 
items. 

2.2.3  Uniqueness  Considerations 

In  defining  the  sets  of  our  data  structure  model,  we  have  implicitly 
made  a  very  important  decision.  If  we  examine  the  union  of  all  sets 
Tm  associated  with  a  given  source  d^  (as  given  by  the  set  A^  associated 
with  the  set  A*  of  which  the  given  source  is  an  element),  we  find  that 
in  each  of  the  relation  symbol/target  set  pairs  the  relation  symbol  is 
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unique.  In  fact,  there  is  a  one-to-one  correspondence  between  the 
elements  of  this  union  and  the  elements  of  P^  To  be  more  specific, 
if  we  let 

if  =  {(r.n2£)!  (r.n2)  e  r,  d,  r.  n2  £  a,  d.  £ 
or  alternatively 


r°  =  u  r 

1  meM°  m 


o  2 

where  =  {m  I  d^e  A^  e  A*},  then  thefollowing  conditions  will  hold: 

(1)  (rjnpf-  r?<==^r.  e  P.  for  i  e{l,  2, «* *  •  ,k°} 

(2)  r  /  r  for  (r  Tl2  ),  (r  II*  )  e  r°  and  j  /  j 

h  ]2  31  h  J2  *2  1  1  £ 


On  the  other  hand,  if  we  examine  the  union  of  all  sets  2^  associated 
with  a  given  target  p^,  we  find  that  in  each  of  the  source  set*/relation 
symbol  pairs  the  relation  symbol  is  not  necessarily  unique.  That  is 


if  we  let. 

rj)iAm  "jV  S2-Pke  n> 

2 

then  it  is  not  necessarily  the  case  that  r.  /  r.  for  (A  ,  r.  ), 

>1  >2  “l  '1 

(Am2rjf  4h- 

*  A  "source  set"  is  roughly  analogous  to  our  notion  of  a  target  set. 
Whereas  the  sets  are  target  sets,  the  sets  Am  niay  he  considered 
source  sets0  Strictly  speaking,  the  elements  of  a  set  2^  are  source/ 
relation  symbol  pairs  (d.r.)  and  not  source  set/relation  symbol  pairs 
(Am  r,).  1  ' 
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For  ease  of  reference  we  will  say  that  this  set  of  conditions 
implies  uniqueness  of  type  1  . 

We  may,  of  course,  wish  to  consider  the  opposite  set  of  conditions, 
where  the  relation  symbols  in  the  set  of  source  set/relation  symbol 
pairs  associated  with  a  given  target  p^  are  unique,  and  where  the 
relation  symbols  in  the  set  of  relation  symbol/target  set  pairs  asso¬ 
ciated  with  a  given  source  d.  are  not  necessarily  unique.  We  will  say 
that  this  set  of  conditions  implies  uniqueness  of  type  2.  Redefinition 
of  the  sets  of  our  data  structure  model  to  effect  uniqueness  of  type  2  is 
straightforward  and,  therefore,  will  not  be  done  here. 

It  should  be  clear  that  we  cannot  require  both  uniqueness  of  type  1 
and  uniqueness  of  type  2  simultaneously  except  in  very  special  cases. 

It  should  also  be  clear  that  it  is  not  desirable  to  require  "non-unique¬ 
ness’'  of  both  types  simultaneously ,  for  in  this  case  our  sets  and, 
hence,  the  model  cease  to  be  uniquely  defined. 

We  assume  therefore,  that  the  "problem  solver"  must  specify 
which  type  of  uniqueness  he  assumes  when  he  describes  the  data  for 
his  problem  in  terms  of  our  data  structure  model. 

Since  the  principles  involved  are  no  different  whether  considering 
uniqueness  of  type  1  or  of  type  2,  unless  otherwise  indicated  we  will 
assume  uniqueness  of  type  1  to  be  in  effect  for  the  remaindei  of  the 
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discourse*. 


In  light  of  this  discussion  we  can  comment  further  upon  the  re¬ 
lation  symbols  represented  by  the  nodes  at  level  5  of  the  structure  of 
Figure  2-1. 

The  nodes  designated  r.  ,  r.  ,  and  r  (and  ostensibly  representing 

31  32  h 

the  relation  symbols  r.  ,  r.  ,  and  r.  )  must  represent  distinct 

31  J2  33 

relation  symbols  since  these  relation  symbols  appear  in  relation  symbol/ 
target  set  pairs  which  are  all  elements  of  the  same  set  r°  corre- 

h 

spending  to  data  item  d.  . 

It  is  also  true  that  the  relation  symbols  represented  by  the  nodes 

designated  r.  and  r.  must  be  distinct,  but  for  a  different  reason. 

2  34 

In  particular,  since  both  relation  symbols  are  associated  with  the 

same  target  set,  equality  of  these  relation  symbols  would  imply  a 

single  element  of  the  set  r  and,  hence,  a  single  node  in  the  structure. 

On  the  other  hand,  the  relation  symbol  represented  by  the  node 

designated  r,  need  not  be  distinct  from  either  of  the  relation  symbols 
3  5 

represented  by  the  noies  designated  r.  and  r.  in  spite  of  the  fact 


h 


that  all  three  relation  symbols  appear  in  source  set /relation  symbol 

pairs  which  are  elements  of  the  set  6,  corresponding  to  data  item 

K1 

pk,- 


Note  that  the  prototype  program  to  which  we  will  refer  later  in  this 
dissertation  has  provision  for  considering  both  types  of  uniqueness. 
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In  general,  the  relation  symbol  for  node  r.  need  not  be  distinct 

J5 

from  those  for  nodes  r.  and  r.  ,  either.  Finally,  the  relation  symbol 

J1  33 

for  node  r.  need  not  be  distmct  from  that  for  node  r.  . 

J3  h 

This  completes  the  development  of  our  model  for  data  structure. 

2.3  Use  of  the  Model 

Now  that  we  have  a  model  for  data  structure,  we  must  specify  the 
manner  in  which  it  is  to  be  used.  Specific  details  will  be  considered 
in  succeeding  chapters,  but  we  will  provide  here  certain  basic  aspects 
of  the  use  of  the  model. 

The  problem  solver  (i.e. ,  the  individual  who  is  confronted  with 
a  prob’em  and  charged  with  obtaining  its  solution)  must  specify  what 
(or  rather,  how  many)  data  items  and  what  (again,  how  many)  relations 
are  involved  in  the  solution  of  his  problem.  The  particular  data  items 
and,  to  a  much  greater  extent,  the  particular  relations  which  the  pro¬ 
blem  solver  chooses  to  characterize  his  problem  are  influenced  very 
strongly  by  his  choice  of  access  processes. 

For  example,  suppose  the  problem  to  be  solved  involves  (among 
others)  three  classes  A,B,  and  C  of  data  items.  Furthermore,  suppose 
there  exists  a  one-to-one  correspondence  fromthe  elements  of  each  of 
these  classes  onto  the  elements  of  each  of  the  other  two.  Finally, 
suppose  th^t  given  an  element  of  one  of  these  classes,  the  corresponding 
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elements  of  the  other  two  classes  are  to  be  determined  (i.e. ,  accessed) 
Assuming  that  we  are  given  an  element  a  e  A  as  the  intial  starting 
point,  we  have  three  choices  of  procedures  for  determining  the  corre¬ 
sponding  elements  b  e  B  and  c  e  C: 

Procedure  1 

(1)  Given  a  e  A,  determine  b  e  B. 

(2)  Given  be  B,  determine  c  e  C. 

Procedure  2 

(1)  Given  ae  A,  determine  c  e  C. 

(2)  Given  c  e  C,  determine  be  B. 

Procedure  3 

(1)  Given  ae  A.  determine  b  <;  B» 

(2)  Given  ae  A,  determine  c  e  C„ 

It  follows  that  we  have  (at  least)  three  choices  of  relations  which  we 
may  define.  If  procedure  1  is  selected  to  access  the  various  data  items, 
we  may  define  a  relation  r&b  from  A  to  B  and  a  relation  rfec  from  B  to  C; 
if  procedure  2  is  selected,  ve  may  define  a  relation  r  ^  from  A  to  C  and 
a  relation  r  ^  from  C  to  B;  and  finally,  if  procedure  3  is  selected,  we 
may  define  a  relation  r  from  A  to  B  and  a  relation  r  from  A  to  C. 

o.D  cLC 

Although  this  is  a  very  simply  example,  the  concepts  which  it 
embodies  carry  over  to  much  more  complex  situations. 
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By  examining  his  data,  the  problem  solver  must  also  determine  the 
cardinalities  of  the  various  sets  in  our  data  structure  moaei. 

Actually,  the  problem  solver  need  not  determine  the  cardinality 
of  each  instance  of  a  set,  however,  but  need  only  determine  the  average 
cardinality  of  each  type  of  set.  For  example,  rather  than  determine 
the  actual  cardinality  of  the  set  A^  for  each  f  e  {1, 2, » •  °  ,n^},  the  pro¬ 
blem  solver  need  only  determine  the  average  cardinality  of  the  set 
A*  over  all  £  e  {1, 2,  •  •  *  ,n<j}.  The  reasononly  the  average  cardinalities 
are  required  should  become  clear  in  the  next  chapter. 

There  is,  of  course,  a  very  good  reason  why,  from  the  problem 
solver's  point  of  view,  actual  cardinalities  are  not  desirable.  Namely, 
for  data  str  ictures  of  even  moderate  size  these  cardinalities  may  be 
extremely  difficult,  if  not  impossible,  to  determine.  In  order  to  do  so, 
one  vould  prohably  'cave  to  construct  a  structure  like  i.hat  of  Figure  2-1 
for  the  entire  data  structure.  On  the  other  hand,  as  we  shall  see  in 
Chapter  VI,  determining  average  cardinalities  for  the  data  structure 
sets  is  a  relatively  painless  task. 

We  might  point  out,  however,  that  there  unfortunately  exists  no 
universal  technique  which  may  be  applied  to  all  problems  to  determine 
the  average  cardinalities  of  the  data  structure  sets.  The  process  which 
must  be  used  is  very  much  a  function  of  the  particular  problem  and  the 
availability  of  information  to  the  problem  solver.  Nevertheless,  an 
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example  we  consider  in  Chapter  VI  should  indicate  some  general  guide¬ 
lines. 

It  may  happen  that  the  mathematical  variance  of  the  actual 
cardinalities  for  a  particular  type  of  set  is  very  large,  in  which  case 
one  might  justifiably  question  the  use  of  an  average  value.  This  is 
particularly  t  rue  if  the  cardinalities  are  characterized  by  a  bimodal 
distribution  for  which  the  average  value  characterizes  neithe  r  of  the 
peaks. 

Such  a  case  would  normally  arise  where  the  set  of  data  items  may 
be  partitioned  into  two  or  more  classes  among  which  there  is  no  inter¬ 
action.  For  example,  suppose  that  the  set  of  data  items  consists  of  the 
(disjoin*-'  sets  A,  B,  C,  D,  and  E  of  data  items.  Zvppose  further  that 
the  elements  of  these  sets  are  related  by  the  following  relations:, 

r  ,  from  A  to  B 
ab 

r,  from  D  to  A 
da 

r  from  C  to  E 
ce 

Clearly,  then,  there  is  no  "communication"  between  any  element  of  the 
sets  A,  B,  and  D,  and  any  element  of  the  sets  C  and  E  (although  elements 
within  the  cluster  of  sets  A,B,  and  D  do  communicate  from  one  set 
to  another,  as  do  elements  within  the  cluster  of  sets  C  and  E).  We 
may  then  partition  the  set  of  data  items  into  two  noncommunicating 
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classes  consisting  of  the  elements  of  sets  A,  B,  and  D  and  the  elements 
of  sets  C  and  E,  respectively. 

We  may  treat  this  situation  as  consisting  of  two  (or  more,  as  the 
case  may  be)  distinct  problems  for  which  separate  data  structures 
may  be  defined.  In  a  general  sense,  we  may  do  this  even  where  the 
boundaries  between  the  classes  are  not  so  well  defined. 

Clearly,  the  decision  to  partition  a  problem  into  sub-problems 
lies  with  the  problem  solver.  For  our  purposes,  we  may  assume 
simply  that  the  data  structure  given  is  the  only  one  of  concern. 

Once  the  average  cardinalities  for  the  various  data  structure 
sets  have  been  specified  by  the  problem  solver,  we  apply  the  techniques 
which  are  developed  in  the  next  three  chapters  to  determine  the  best 
storage  structure  for  that  data  structure. 


53 


Chapter  HI 

A  MODEL  OF  STORAGE  STRUCTURE 

In  the  previous  chapter  we  developed  a  mathematical  model  for 
the  intrinsic  structure  of  an  arbitrary  collection  of  elementary  data 
items.  Our  goal  in  this  chapter  is  the  development  of  a  model  for 
all  storage  structures  capable  of  representing  the  data  structure 
of  that  collection  of  data  items. 

We  desire  a  model  which  can  assume  the  form  of  any  of  the 
three  basic  organizations  -  sequential,  list,  and  random  -  or 
combinations  thereof.  We  also  desire  a  model  which  is  capable  of 
representing  the  data  structure  of  a  collection  of  data  items  in 
varying  degrees  of  detail. 

Let  us  assume  that  all  storage  structures  to  be  considered  will 
be  resident  in  a  uniform  storage  medium, each  unit  of  which  (such  as 
a  byte  or  word)  can  be  accessed  in  the  same  number  of  time  units 
as  any  other  unit.  This  type  of  storage  medium  is  commonly  called 
random  access  storage.  The  classical  example  of  random  access 

storage  is,  of  course,  magnetic-core  memory. 

We  will  also  make  what  might  be  called  a  storage  management 

disclaimer.  We  assume  that  there  exists  some  mechanism  for  the 
management  of  the  storage  in  which  our  storage  structure  resides, 
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that  is,  some  overseer  which  keeps  track  of  those  storage  locations 
available  for,  but  not  actually  occupied  by.  the  structure.  This 
storage  supervisor  is  charged  with  providing  upon  request  unoccupied 
storage  locations  for  use  by  the  storage  structure  and  is  also  charged 
with  reclaiming  those  storage  locations  no  longer  in  use  by  the 
structure.  Although  the  implementation  of  these  functions  is  of 
considerable  importance,  such  considerations  do  not  fall  within  the 
scope  of  our  efforts  here.  Therefore,  beyond  acknowledging  the 
existence  of  a  storage  manager  and  acknowledging  its  importance 
in  the  implementation  of  any  storage  structure, we  shall  ignore  it. 
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3. 1  Initial  Storage  Structure  Model 


Examine  our  data  structure  model  as  it  appears  in  Figure  2-1, 

As  the  first  step  in  the  development  of  a  storage  structure  model, 
let  us  develop  a  single  storage  structure  capable  of  representing  all 
the  detail  of  the  data  structure  of  this  model. 

Let  each  node  of  the  DSM  (the  Data  Structure  Model  of  Figure  2-1) 
be  represented  by  a  block  of  contiguous  storage  units  (bytes,  words, 
etc.)  and  let  each  ring  of  the  DSM  be  represented  by  a  ring  of  (list) 
pointers  where  the  unique  block  of  the  ring  (i.e. ,  the  block  which 
represents  a  node  at  level  2,4,6,  or  8  of  the  DSM)  acts  as  the  head 
of  the  ring. 

For  every  pointer  thus  appearing  in  a  ring  let  us  include  a 
pointer  in  the  opposite  direction,  resulting  in  a  ring  linking  all  nodes 
in  the  direction  opposite  to  the  first  ring.  We  will  distinguish  these 
two  types  of  rings  by  saying  the  first  consists  of  forward  pointers 
and  the  second  consists  of  reverse  pointers  (although  the  distinction 
as  to  which  ring  contains  forward  pointers  and  which  reverse 
pointers  is  purely  academic).  At  times  we  may  also  refer  simply 
to  "pointers"  (other  than  in  the  generic  sense)  in  which  case  we  will 
mean  forward  pointers. 

Let  us  also  include  in  every  ring  a  pointer  from  each  element 
block  (as  distinct  from  the  head)  to  the  head.  This  pointer  we  will 
call  a  head  pointer. 
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At  this  point  the  blocks  of  our  storage  structure  consist  only  of 
pointer  fields.  We  will,  of  course,  have  to  include  information  about 
the  data  items.  Let  this  information  be  placed  in  a  number  of  data 
item  description  blocks  ,  one  for  every  elementary  data  item.  Then 
let  each  block  which  represents  some  data  item  (i.e. ,  each  block 
which  corresponds  to  a  node  at  level  1  or  level  9  of  the  DSM)  contain 
a  pointer  to  the  corresponding  data  item  description  block.  Each 
description  block  (short  for  data  item  description  block)  will  then  be 
pointed  to  by  at  least  one,  but  no  more  than  two,  description  block 
indicators. 

In  each  block  representing  a  node  at  level  5  of  the  DSM  we  will 
include  a  field  to  contain  the  name  (some  appropriate  code)  of  the 
relation  symbol  which  the  node  represents.  Finally,  through  each 
block  containing  the  name  of  a  given  relation  symbol  we  will  pass 
a  ring  joining  all  blocks  containing  the  name  of  that  particular 
relation  symbol.  Clearly,  the  number  of  such  rings  will  be  equal  to 
the  cardinality  of  P.  In  fact,  there  will  be  a  one-to-one  correspond¬ 
ence  between  these  rings  and  the  relations  in  P .  Each  of  these 
rings,  which  we  designate  relation  rings,  will  have  a  head  which  is 
distinct  from  the  blocks  representing  nodes  at  level  5  of  the  DSM. 

The  resultant  storage  structure,  sans  reverse  and  head  pointers, 

is  shown  schematically  in  Figure  3-1.  We  have  assumed  for  the 

purposes  of  illustration  that  the  nodes  designated  r.  and  r.  in  the 

]3  h 
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Initial  Storage  Structure  Model 


DSM  represent  the  same  relation  symbol  r^  .  We  will  designate  the 
structure  of  Figure  3-1  as  the  Initial  Storage  Structure  Model,  or 
simply,  the  ISSM.  Those  pointers  emanating  from  the  blocks  and 
the  bjj  blocks  and  followed  by  terms  of  the  form  d.  and  p^, 
respectively,  represent  the  description  block  indicators.  (The 
description  blocks  are  not  shown.) 

3. 2  Transformations 

Instead  of  the  structure  described  above,  we  could,  of  course, 
have  chosen  a  storage  structure  in  which  no  head  pointers  are  present, 
or  one  in  which  no  reverse  pointers  are  present,  or  one  in  which 
head  pointers  are  present  only  for  certain  rings,  or  one  in  which  some 
rings  are  replaced  by  stacks,  and  so  on. 

Clearly,  our  storage  structure  model  must  be  flexible  enough 
to  allow  us  to  represent  variations  of  this  sort.  We  will  achieve  this 
flexibility  by  applying  certain  transformations  to  the  ISSM  to  yield 
the  storage  structure  of  interest. 

These  transformations,  which  may  be  applied  individually  and 
in  combination  to  the  ISSM,  are  called 

(1)  Stacking 

(2)  Duplication 

(3)  Elimination 

Before  considering  the  effect  of  each  of  these  transformations 
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upon  the  1SSM  note  the  symmetry  and  the  repetitive  nature  of  that 
structure.  The  blocks  designated  (by  type)  b b^,  bg,  and  b^  in 
Figure  3-1  each  act  as  the  head  of  two  rings,  one  above  and  one 
below  the  block.  On  the  other  hand,  the  blocks  designated  bg,  bg. 
and  bg  each  act  as  an  element  in  each  of  two  ,rback-to-backM  rings. 

The  blocks  designated  b^  and  b^  also  act  as  ring  elements  but 
appear  in  only  one  ring. 

Thus,  the  ISSM  is  composed  essentially  of  repetitions  of  the 
structure  of  Figure  3-2.  The  blocks  designated  andXg  (specifi¬ 
cally,  the  blocks  containing  x^,  x^,  xgj>  and  Xgg)  function  as  the 
heads  of  two  rings  each  (only  one  of  which  is  shown  in  del  ail)  and  the 
blocks  designated  x,,  (specifically  the  blocks  containing  x^,  x^, 
and  x2g)  each  function  as  elements  of  back-to-back  rings. 

To  simplify  further  discourse,  instead  of  writing  "the  blocks 
designated  (by  type)  x^"  we  will  write  "the  x^  blocks"  and  instead  of 
writing  "the  block  containing  x^"  we  will  write  simply 

Consider  now  the  effect  of  each  of  the  three  transformations 
given  above  upon  the  structure  of  Figure  3-2. 

The  "stacking"  transformation  causes  a  given  ring  to  be  turned 
into  a  stack.  For  instance,  if  we  apply  the  stacking  transformation 
to  the  z^ -rings  of  Figure  3-2,  then  for  each  z^-ring  we  form  a 
stack  of  x^-blocks  (which  act  as  elements  of  the  given  z^ring)  upon 
the  corresponding  Xj -block  (which  acts  as  head  of  the  given  z^-ring). 


60 


Similarly,  if  we  apply  the  stacking  transformation  to  the  z2~  rings, 
then  for  each  z_-ring  we  form  a  stack  of  x.-blocks  upon  the  corre  - 
sponding  x^ -block.  Note  that  since  the  x^-blocks  are  shared  by  both 
the  z^-rings  and  the  Zg-rings,  the  Xg-blocks  may  he  stacked  upon 
either  the  x^ -blocks  or  the  x^ -blocks  but  not  both.  Figure  3-3 
illustrates  this  transformation  as  applied  to  the  z^-rings  of  Figure  3-2. 

An  alternative  to  stacking  is  the  "duplication”  transformation. 
Applying  this  transformation  to  a  ring  causes  a  copy  of  the  bead  of  the 
ring  to  be  concatenated  with  each  element  block  of  the  ring.  In 
addition,  for  each  copy  of  the  head  block  which  is  created  by  this 
transformation,  a  copy  of  the  second  ring  of  which  the  block  is  head 
is  created.  The  net  result  is  the  removal  of  a  given  ring  from  the 
structure  by  duplicating  certain  other  rings.  Figure  3-4  illustrates 
duplication  as  applied  to  the  z^ -rings  of  Figure  3-2.  In  order  to  more 
clearly  illustrate  the  effect  of  duplication  upon  the  second  ring 
associated  with  a  duplicated  head  block,  assume  *1  't  the  x^ -blocks  of 
Figure  3-2  are  duplicated  upon  the  element  blocks  of  the  rings  below 
them.  The  result  of  this  duplication  is  shown  in  Figure  3-5.  Note  that 
all  blocks  in  the  copies  of  the  ring  associated  with  a  duplicated  block 
must  maintain  membership  in  any  other  rings  of  which  blocks  of 
the  copied  ring  were  members. 

Contrary  to  stacking,  duplication  may  be  applied  to  either  the 
z^-rings  or  the  ;  2~rings  or  both  s:mulataneously. 

If  auplicaiion  is  applied  to  both  the  zi -rings  and  the  z0-rings,  the 

62  x  4 
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Secondary  Effects  of  Duplication  Transformation 


structure  of  Figure  3-6  will  result.  This  leads  us  to  our  third 
transformation,  ’’elimination".  If  the  x^-blocks  are  being  used 
purely  for  connective  purposes  (like  the  b^ -blocks  and  bg-blocks  of 
Figure  3-1),  they  can  be  iemoved,  or  eliminated,  from  the  structure 
of  Figure  3-6  since  in  this  structure  there  is  a  one-to-one  corre¬ 
spondence  between  the  -blocks  and  the  x^ -blocks  which  they 
connect.  The  resultant  structure  would  appear  as  in  Figure  3-7. 

Finally,  we  can  apply  stacking  to  the  z^- rings  of  Figure  3-2  and 
duplication  tc  the  Z2-rings  or  vice-versa  which,  for  the  first  case, 
will  yield  the  structure  of  Figure  3-8. 

It  should  be  apparent  that  stacking,  duplication,  and  elimination 
can  be  applied  in  various  combinations  simultaneously  to  the  several 
rings  of  the  ISSM.  In  order  to  specify  mere  rigorously  just  what 
combinations  of  these  transformations  are  permissable,  we  will  intro¬ 
duce  a  number  of  decision  variables  which  pertain  to  the  various  trans¬ 
formations  for  each  of  the  types  of  rings  within  the  ISSM.  Before  doing 
this,  however,  we  will  consider  a  number  of  peripheral  issues. 

3.3  Additional  Model  Characteristics 

At  first  glance  it  might  appear  desirable  to  treat  each  ring  of 
the  ISSM  independently  from  all  the  others  with  regard  to  the 
transformations  which  we  apply  to  it.  For  instance,  we  may  wish 
to  maintain  one  a^  -~ing  of  the  ISSM  as  a  ring,  apply  the  stacking 
transformation  to  another  a^-ring,  and  apply  the  duplication 
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transformation  to  still  another  a^-ring.  It  quickly  becomes  apparent^ 
however,  that  such  a  posture  has  two  pitfalls.  First,  it  is  doubtful 
that  such  a  structure  can  be  used  in  a  very  efficient  manner.  At  each 
step  in  the  performance  of  an  operation  upon  the  structure  one  must 
determine  the  convention  applying  to  that  particular  portion  of  the 
structure  and  then  choose  the  proper  code  to  effect  the  desired  step. 
Second,  the  number  of  variables  required  to  specify  such  a  structure 
via  any  given  model  would  quickly  exceed  our  ability  to  consider 
all  possible  variations  thereof. 

Therefore,  we  will  make  the  assumption  that  the  application  of 
a  transformation  to  any  ring  of  a  given  type  implies  the  application  of 
that  transformation  t  all  rings  of  the  given  type.  Thus,  all  rings 
(and  all  blocks)  of  a  given  type  are  assumed  to  be  treated  uniformly 
and  our  model  is  said  to  be  homogeneous. 

As  a  second  issue,  notice  that  applying  duplication  to  the  a„- rings 
of  the  ISSM  or  to  the  ag-rings  or  to  both  will  in  general  result  in  the 
generation  of  several  copies  of  each  bg-block.  (See  Figure  3-1.) 
Clearly,  this  results  in  an  increase  in  the  number  of  such  blocks  in 
each  relation  ring.  In  particular,  this  results  in  the  generation 
of  two  or  more  bg-blocks  to  represent  a  single  relation  symbol  as 
it  appears  in  the  DSM  of  Figure  2-1.  As  we  shall  see  later,  certain 
operations  which  we  may  want  to  perform  upon  the  data  represented 
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by  a  particular  storage  structure  may  use  a  given  relation  ring 
to  search  for  the  occurrence  of  a  particular  relation  symbol  (as 
used  in  the  DSM  and  as  characterized  by  its  association  with 
one  or  more  relation  instances).  If  we  can  guarantee  that  the 
relation  ring  contains  no  more  than  one  bg-block  representing 
the  relation  symbol  sought,  then  once  a  bc-block  repre- 

w 

senting  the  desired  relation  symbol  has  been  found,  the 

remainder  of  the  b--blocks  in  the  relation  ring  may  be 

o 

ignored.  On  the  other  hand ,  if  the  relation  ring  may 
contain  more  than  one  bg-block  representing  the  relation 
symbol  sought,  then  even  when  a  bg-block  representing 
the  desired  relation  symbol  has  been  found,  the  remainder 
of  the  bg-blocks  in  the  relation  ring  must  be  examined  to 
check  for  additional  blocks  representing  the  relation 
symbol. 

It  may,  therefore,  be  advantageous  for  the  relation 
rings  to  maintain  their  original  compositions  (and,  hence, 
guarantee  that  there  exists  no  more  than  one  bg-block 
representing  a  given  relation  symbol)  regardless  of  trans¬ 
formations  applied  to  the  rest  of  the  ISSM.  For  this 
reason  we  will  now  replace  each  bg-block  by  the  struc¬ 
ture  of  Figure  3-9.  We  assume,  of  course,  that  the  rings 
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therein  n.  \y  (like  the  rings  of  the  ISSM)  contain  reverse  and  head 
pointers. 

The  bg- block  and  the  b^-block  of  Figure  3-9  (each  of  which 
contains  afield  for  the  name  of  a  relation  symbol)  replace  the  bg-block 
in  the  a^-ring  and  in  the  a^-ring  of  the  ISSM,  respectively.  In  the 
ISSM  upon  which  no  transf  or  nations  have  been  made  there  will  then 
be  one  bg-block  and  one  b^-biock  fo?  each  (new)  bg-block.  Applying 
duplication  to  the  a^- rings  and  to  the  ag- rings  now  results  in  tl  5 
generation  of  copies  of  b^-blocks  and  b^-biocks,  respectively. 

Each  copy  of  a  bg-block  which  corresponds  to  a  particular  bg- block 
is  put  into  he  ag-ring  a  sociated  with  that  bg-block.  Similarly, 
each  copy  of  i  b^-block  is  put  into  the  associated  ag-ring. 

We  see  nr  w  that  the  b„- block  has  assumed  a  new  role  in  our 

U 

storage  struc  ture  model  -  it  now  functions  as  an  head  (of  the  ag-ring 
and  the  ag-ring)  instead  of  as  an  element  block.  Our  previous 
discussions  of  stacking,  duplication,  and  elimination  still  apply, 
however,  since  the  repetitive  and  symmetric  nature  of  the  structure 
remains  unchanged. 

Substituting  the  structure  of  Figure  3-9  for  each  bg-block  in 
the  ISSM  of  Figure  3-1  yields  the  structure  of  Figure  3-10,  which 
we  will  henceforth  call  our  Storage  Structure  Modecor  simply  SSM. 

By  making  this  substitution  we  have  clearly  made  no  changes 
which  would  affect  the  ability  of  the  structure  to  represent  the 
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Figure  3-10.  Storage  Structure  Model 


intrinsic  structure  of  a  collection  of  data  items  as  modeled  by  the 
DSM, 

A  third  issue  to  consider  also  involves  the  effects  of  duplication. 


Whenever  duplication  is  applied  to  the  ag-rings  or  the  ag-rings  of 
the  SSM,  copies  of  b^-blocks  and  b^ -blocks,  respectively,  are 
generated.  Since  certain  operations  which  we  may  want  to  perform 
upon  the  data  represented  by  a  particular  storage  structure  may 
require  the  ability  to  access  all  copies  of  a  given  b^-block  or  b^- 
block  as  they  represent  a  given  source  or  target,  respectively,  we 
will  introduce  for  each  b^-block  and  each  b^-block  of  the  SSM  a  ring 
to  contain  all  copies  of  the  block  generated  by  duplication.  These 
rings  (which  are  similar  to  the  relation  rings  for  bg -blocks)  will  be 
called  source  rings  for  b^-blocks  and  target  rings  for  b^-blocks. 

As  a  final  issue  we  introduce  the  possibility  cf  a  block  type 

field  for  each  block  of  the  SSM.  In  certain  cases,  especially  when 
stacking  and  duplication  have  been  applied  to  several  pairs  of  adjacent 
rings,  it  may  be  advantageous,  if  not  imperative,  to  have  some  means 
for  distinguishing  the  different  types  of  blocks  from  one  another.  To 
accomplish  this,  we  may  include  within  each  block  a  type  field  which 
contains  some  code  identifying  he  type  of  the  block. 

Since  the  SSM  contains  only  eleven  distinct  types  of  blocks 


(b-pbg,** '  a  tyPe  iield  nee(*  contain  at  most  four  bits.  If 


fewer  than  eleven  types  are  to  be  distinguished  (due  to  the  elimination 


of  certain  blocks  from  the  SSM,  for  instance),  an  even  smaller 
type  field  may  be  used.  If  s^rr-e  other  item  in  a  given  type  of 
block  does  not  use  the  entire  field  allocated  to  it  (as  is  frequently 
true  with  pointer  fields) ,  it  may  be  possible  to  put  the  type  code  in 
that  unused  space,  thus,  eliminating  the  need  for  a  separate  type 
field.  In  any  event  we  will  provide  the  option  of  including  a  type 
field  in  the  blocks  of  any  given  type. 

3.4  Decision  Variables 

Return  now  to  consideration  of  the  decision  variables  which  we 
will  use  to  describe  the  SSM  and  transformations  applied  thereto. 

Let  be  a  binary-valued  decision  variable,  the  value  of  which 
indicates  whether  (0.=1)  or  not  (0,,=Q)  forward  pointers  are  present 
in  the  airings  of  the  SSM,  where  ie  {1, 2,  •  •  • ,  10} .  0.=O  implies 

that  either  the  stacking  or  the  duplication  transformation  has  been 
applied  to  the  a^ -rings  and  that  they  are  no  longer  really  rings  at  all. 

Let  Aw  be  a  binary- valued  decision  variable,  the  value  of  which 
indicates  whether  the  stacking  transformation  (A^O)  or  the  dupli¬ 
cation  transformation  (A^=l)  has  been  applied  tc  the  a^-rings  of  the 
SSM  when  0^=0,  where  ie  {1, 2,  •  •  • ,  10} .  As  a  matter  of  convenience, 
we  will  require  ApO  when  0.  =1 . 

Let  0j  be  a  binary- valued  decision  variable,  the  value  of  which 
indicates  whether  (0j-l)  or  not  (01=0)  head  pointers  are  present  in 
the  a. -rings  of  the  SSM,  where  ie{l,2,  •••  ,10}.  Note  that  head 


76 


pointers  may  De  present  even  though  the  a^- rings  are  not  actually 
rings.  For  instance,  head  pointers  may  be  used  to  advantage  when  the 
element  blocks  of  a  ring  are  stacked  upon  the  head. 

Let  0!f  be  a  binary -valued  deciscion  variable,  the  value  of  which 
indicates  whether  (07  =1)  or  not  (07  =  0)  reverse  pointers  are  present 
in  the  airings  of  the  SSM,  where  ie  {1, 2,  •  •  • ,  10}.  We  will  assume 
that  07  can  equal  1  only  if  0.-1.  That  is,  we  will  assume  that  reverse 
pointers  are  present  in  the  airings  only  if  forward  pointers  are. 

What  this  really  means  is  that  if  the  a.-rings  are  linked  via  pointers 
in  one  direction  only,  then  these  pointers  are  forward  pointers; 
reverse  pointers  are  present  only  to  provide  two-way  linking. 

Let  (k  be  a  binary  -valued  deciscion  variable,  the  value  of  which 
indicates  whether  (<3.=1)  or  not  (^=0)  the  b.-blocks  of  the  SSM  are  to 
be  eliminated  from  the  structure,  where  ie{2,3,*  •  •  ,10}.  (The  bj-blocks 
and  the  b^-blocks  are  assumed  always  to  be  present.)  From  our 
discussion  of  elimination  we  know  that  the  value  of  ^  is  determined  by 
the  values  of  A.  ^  and  Ai  where  ie { 2, 3,  •  •  • ,  1 0} .  In  particular,  if 
Af  j  =  A.  =  1,  then  jS.  =  1;  otherwise  (3p0.  In  other  words,  the 
^-blocks  are  eliminated  from  the  SSM  if  and  only  if  duplication  has 
been  applied  to  the  a^  ^-riags  ana  to  the  airings,  where  ie  {2,3,  •  *  •  ,10}. 

Finally,  let  r.  bo  a  binary-valued  decision  variable,  the  value 
of  which  indicates  whether  (r.-l)  or  not  (is=0)  the  b^-blocks  of  the 
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SSM  contain  a  type  field,  where  ie  {1 , 2,  •  •  • ,  11}. 

There  are  a  number  of  constraints  (some  of  which  we  have 
already  mentioned)  which  the  above  decision  variables  must 
satisfy  in  order  to  insure  a  physically  realizable  structure.  Each 
of  these  will  oe  considered  in  turn  below. 

Constraint  1 

^  +  Ai  <  1  for  all  ie{l,  2,  *  •  •  ,10} 

This  constraint  implies  that  <p .  and  A^  may  not  both  be  equal  to 
1.  That  this  should  be  trie  is  obvious:  <^  =  1  implies  explicit 
a.-rinfs  of  (forward)  pointers  and  a. -*1  implies  the  application 
of  duplication  to  the  rings,  clearly  an  impossible  situation. 

Constraint  2 

0V  <  0.  for  all  ie{l,2,  ‘  ’  ,10} 

Thi"  constraint  implies  that  the  a^-rings  may  not  contain 
reverse  pointers  unless  they  contain  forward  pointers. 

Constraint  3 

0!  +  Aj  <  1  for  all  ie  {1 , 2,  *  •  *  ,10} 

The  purpose  of  this  constraint  is  to  prohibit  head  pointers  fiom 
the  a^-rings  if  duplication  has  been  applied  to  them.  Clearly,  a 
head  pointer  would  be  to  no  advantage  in  a  ring  to  which  duplication 
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has  been  applied  since  the  head  is  attached  dii  jUy  to  each  element 
block. 

Constraint  4 

%  =  Ai-lAi  f°r  i€ {2, 3 ,  -  •  * .  10} 

This  constrain  implies  that  the  b. -bleaks  may  be  (in  fact,  must 
be)  eliminated  from  the  SSM  if  and  only  " .  duplication  has  been 
applied  to  the  rings  on  both  sides  of  the  Hocks. 

Constraint  5 

0i  +  0i+l+ Ai+Ai+1  —  1  f or  all  ie  {  2, 4, 6 , 8} 


The  purpose  of  this  constraint  is  to  prohibit  the  application  of 
stacking  to  both  (types  of)  rings  which  share  common  element  blocks. 

Upon  close  examination  of  the  SSM  it  becomes  clear  that  we  must 
generalize  this  constraint  somewhat  to  yield  the  following  constraint. 


k 

E  Wht-1 

i=j  1 


for  all  je  {2, 4,6,8},  ke  {3,5 , 7, 9},  and  k  >  j 


The  implication  is  that  if  each  type  of  ring  from  the  a^ -rings 
to  the  a^-rings  inclusive  is  subjected  to  either  stacking  or  dupli¬ 
cation,  then  duplication  musi  be  applied  to  either  the. a. -rings 

1 

or  the  a^-rings  (or  both).  If  tnis  constraint  is  not  met,  we  have  the 
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(impossible)  situation  analogous  to  applying  stacking  to  rings  which 
share  common  element  blocks.  (In  fact,  if  k  =  j+1,  we  have  precisely 
that  situation. ) 

Perhaps  a  brief  example  would  serve  to  clarify  this 
point.  Assume  that  j=2  and  k=5  and  that  duplication  has  been 
applied  to  both  the  a^- rings  and  the  a^- rings.  In  this 
case  each  combination  of  related  b„-blocks,  b. -blocks,  and 
b_-bloeks  is  ’fused"  into  a  single  block  which  functions  as 

D 

an  element  of  both  an  ag-ring  and  an  a^-ring.  It  follows 
that  stacking  may  not  be  applied  to  both  the  a^-rings  and 
the  a^-rings  simultaneously.  Since  we  have  assumed  that 
either  stacking  or  duplication  lias  been  applied  to  each, 
we  see  that  duplication  must  be  applied  to  either  the 
ag-rings  or  the  a^-rings  (or  both).  We  note  that  if 
stacking  were  applied  to  the  a^ -rings  instead  of  dupli¬ 
cation,  then  duplication  must  be  applied  to  the,  ag-rings. 
Similarly,  if  stacking  were  applied  to  the  a^ -rings  instead 
of  duplication,  then  duplication  must  be  applied  to  the 
a&- rings. 

There  are  a  number  of  other  decisions  wtveh  we 
may  want  to  make  in  characterizing  the  structure  via  the 
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SSM.  In  particular,  instead  of  assuming  that  each  b^-block 
and  each  b^ -block  contains  a  description  block  indicator, 
we  may  wish  to  assume  that  the  description  olocks  are 
attached  directly  to  the  b^-blocks  or  to  the  b^-blocks  or 
both.  Let  cr^  be  a  binary-valued  decision  variable,  the 
value  of  which  indicates  whether  (cr^-1)  or  not  (ct^=0)  the 
b^-blocks  of  the  SSM  contain  description  block  indicators;  if 
they  do  not,  we  assume  that  the  appropriate  description 
block  is  attached  directly  to  each  b^-block.  Let  be 
a  binary-valued  description  variable  defined  similarly  for 
the  b^ -blocks  of  the  SSM. 

When  we  defined  the  data  item  description  block,  we 
indicated  that  there  is  one  description  block  for  every 
data  item.  In  examinLig  transformations  which  may  be  applied 
to  the  SSM  we  have  seen  that  it  is  possible  for  the 
transformed  structure  to  contain  more  than  one  copy  of 
a  b^ -block  and/or  more  than  one  copy  of  b^ -block.  We 
wish  to  reaffirm  at  this  point  the  assumption  that  there 
is  indeed  exactly  one  description  block  for  each  data 
item.  This  means,  of  course,  that  if  cr^  =  i  ,  we 
must  insure  that  there  is  exactly  one  copy  of 
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each  bj-block  in  the  transformed  structure.  Similarly,  if  0, 
we  must  insure  that  there  is  exactly  one  copy  of  each  b^ -block  in 
the  transformed  structure.  We  will  discuss  the  enforcement  of 
these  two  constraints  somewhat  later  in  the  discourse. 

The  assumption  of  one  description  block  for  each  data  item 
perhaps  requires  some  justification.  Possibly,  the  most  compelling 
reason  for  making  the  assumption  is  that  the  data  item  is  the 
smallest, atomic  unit  of  information  to  be  represented  and,  hence, 
it  should  be  represented  in  the  structure  by  some  unique,  well- 
defined,  closed-form  device. 

Consider  a  simple  example.  Suppose  that  the  value  (i.e. ,  the 
description)  of  a  given  iata  item  varies  in  the  course  of  the  solution 
of  a  problem.  For  instance,  the  data  item  might  correspond  to  the 
coordinates  of  a  symbol  being  displayed  upon  a  computer-driven 
graphical  (CPT)  display.  If  the  symbol  is  being  moved  across  the 
display,  its  coordinates  will  vary  with  time  and  must  constantly  be 
updated.  Since  all  descriptions  of  such  a  data  item  must  be  altered 
each  time  its  value  changes,  a  single  description  is  to  be  greatly 
preferred. 

A  second  reason  for  making  the  assumption  is  that  when  a  data 
item  is  both  a  target  and  a  source,  the  description  block  acts  in  a 
connective  capacity,  much  as  Hie  element  blocks  which  are  shared 
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by  back-to-back  rings.  Any  two  such  rings  share  at  most  one 
element  block  between  them.  Thus,  ii  the  analogy  is  to  carry 
over  fully,  there  should  be  only  one  description  block  for  a  given 
data  item. 

The  connective  roie  of  the  description  block  brings  up  another 
point.  If  oy=0  and  cr^  =0,  it  is  clear  that  we  may  not  apply  stacking 
to  both  the  a^ -rings  aid  the  a.  ^  rings.  It  should  also  te  clear  that 
(except  in  special  cases)  whenever  A  and  n  are  not  disjoint,  there 
must  be  at  least  one  if  {1,  2,  •  •  * ,  10}  such  that  the  airings  of  the 
SSM  are  explicit  rings  (i.e. ,  0.-1).  This,  of  course,  is  to  allow 
the  structure  to  ''wrap  around’1.  Thus,  we  have  die  following  con¬ 
straints. 

Constraint  t’ 

V°s  +V,)#+VAio~I 

This  constraint  disallows  stacking  of  both  the  a. -rings  and  the 
a^-rings  when  a,  =0  and  Og-O. 

Constraint  7 

If  cr^=K>,  (Tg-O,  and  !a  i  t-  >h|>!A°i,  then 
10 

);  o.  >  i 
i=l  ' 

This  constraint  implie  ,  the  existence  ci  at  least  one  explicit 
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ring  whenever  A  and  n  are  not  disjoint.  (We  note  that  I A  1+ 1  n  |>  |  a0  I 

is  always  true.  Hence,  Ia  1+ 1  n  1=  I  a0  I  implies  A  and  n  are  disjoint. ) 

10 

We  can  relax  this  constraint  somewhat  to  allow  Y  0.=O  in  cases 

i=l  1 

for  which  A  and  n  are  not  disjoint  provided  we  can  guarantee  that 
d.  r.  d.  and  d.  r.  d.  are  not  both  true  for  any  d.  and  d.  eA° 


li  h  h 


x2  h  xi 


and  any  r.  and  r.  e  P  (d.  and  d.  are  not  necessarily  distinct,  nor 

31  32  h.  x2 

are  r.  and  r.  ). 

]1  J2 

The  next  set  of  decisions  we  may  want  to  make  involves  the 
presence  of  the  relation  symbol  name  field  in  the  b^-blocks,  the 
bg-blocks,  and  the  b^-blocks.  We  may  desire  to  exclude  this  field 
from  certain  of  these  blocks.  Therefore,  we  introduce  the  binary - 
valued  decision  variables  p^,p and  p where  p^  indicates  whether 
(p^=l)  or  not  (pj=0)  the  relation  symbol  name  field  is  present  in  the 
bg-blocks  of  the  SSM,  and  p ^  and  p^  perform  similar  functions  for 
the  bg-blocks  and  the  b^-blocks,  respectively. 

Up  to  this  point  we  have  assumed  that  elimination  of  the  b.  -blocks 
from  the  SSM  always  occurs  if  duplication  has  been  applied  to  the 
a^  j-rings  and  to  the  a^.- rings,  where  ie  { 2 , 3 ,  *  •  “  ,  lCj.  (See  Con¬ 
straint  4.)  We  wish  now  to  alter  this  assumption  somewhat  so  that 
if  the  b  -blocks  contain  relation  symbol  fields,  they  will  not  be 
eliminated  from  the  structure  under  any  conditions.  Clearly,  this 
change  applies  only  for  ie  {5,6,7}.)  Therefore,  we  replace  Con¬ 
straint  4  by  the  following  one. 


84 


Constraint  4* 


For  ie  {2, 3, 4, 8, 9,  10} 

Q.=A.  i  A. 
yl  l-l  l 

For  ie  {  5,6, 7} 

K  Pi„4  =  *>  Pi  =0 
Otherwise,  0.  =  Ai  ^A. 

This  concludes  our  discussion  of  the  decision  variables 
required  to  describe  the  SSM. 

3.5  Quantification  of  Model 

Now  that  we  have  a  formal  manner  (i.  e. ,  the  decision 

variables)  for  describing  the  transformations  which  may  be 

applied  to  the  SSM,  we  would  like  to  determine  the  quantitative 

effects  upon  the  SSM  of  applying  various  transformations. 

13? 

Assume  that  for  each  of  the  sets  of  the  DSM  A^,  A^,  Afn, 

2  3  1 

r  ,  2.,  11.  n  and  n  (for  all  possible  values  of  their 
respective  indices)  we  are  given  the  expected  number  of  elements 
therein  (i.e. ,  the  average  cardinality  of  the  set).  The  expected 
number  of  elements  contained  in  some  set  A*,  where  fe{l,2,*..,i 
can,  for  example,  be  determined  from 
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This  means  that  we  know  the  expected  number  of  element 
blocks  in  each  type  of  ring  of  the  SSM  before  any  transformations 
have  been  applied.  In  particular,  let  k9  represent  the  expected 
number  of  element  blocks  in  each  of  the  a^-rings  of  the  SSM  (before 
the  application  of  transformations),  where  ie  {1 , 2,  •  •  • ,  10} .  ko  and 
k°  will,  of  course,  both  be  1. 

Consider  now  the  effect  of  transformation  of  the  SSM  upon  the 
expected  number  of  element  blocks  in  each  of  the  various  ring  types. 
Let  k.  represent  the  expected  number  of  element  blocks  in  each  of 
the  airings  of  the  SSM  after  the  application  of  transformations, 
where  i  e{l,  2,  •  •  • ,  10}.  Clearly,  the  application  of  stacking  to  the 
a^-rings  has  no  effect  upon  k.  -  it  remains  equal  to  k9.  Furthermore, 
we  wiil  assume  that  elimination  has  no  effect  upon  k^.  Even  though 
the  element  blocks  of  the  a.-rings  may  be  physically  absent,  the 
function  Which  they  perform  's  not  altered  by  this  fact.  We  assume 
that  kj  represents  in  this  case  die  exacted  number  of  ’’virtual” 
element  blocks.  It  follows  that  is  affected  only  by  duplication. 

(We  know  from  our  earlier  discussions  that  k^  will  indeed  be  affected 
by  duplication.) 

For  notational  convenience,  if  >.  is  a  binary-valued  decision 
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variable,  we  define  x  as  having  a  value  which  is  the  complement  of  the 
the  value  of  x.  That  is,  if  x=0,  then  x=l,  and  if  x-1,  then  x=Q. 

We  may  then  write  expressions  governing  the  various  k.  for 
ie  {1 , 2,  *  •  * ,  10}  as  follows: 


k1=A1+A1k| 


k2=A2+^2k2^4+A4K4^6+A6k6^8+A8k8^10+A10k10^^ 
k3=A3+A3k3  ^i+Aik 

k4=A4+A4k  4  ^6+A6k  b  ^8+AGk  8^A10+A10k  10^^ 
k5=A5+A5k5  ^3+A3k3  ^i+Aik<i)' 

k6=^  6k  6  ^8+A8k< 8^10+Ai0k 10^ 

k7=A74A7k7(A5+A5k°5(A3+A3k03(Al+Alkl))) 


k8  A8+A8k8^A10+A10k10' 


k9=A9+A9k°9  f A5k°5{A3+A3k3(Al +Alk  1)))} 


k10=A10+A10k10 


We  can  easily  verify  the  validity  of  these  expressions.  Consider 
first  kj.  If  Aj-0,  each  a^-ring  of  the  SSM  will  be  either  an  explicit 
rmg  (0^=1)  of  k^  bj-blocks  or  a  stack  (0^=0)  of  k^  b^-blocks  upon 
a  bg-block.  (k°  is,  of  course,  the  expected  -  not  actual  -  number 
of  blocks  in  each  of  these  rings  or  stacks.)  If  on  the  other  hand 
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A  =1,  each  a^-ring  of  the  transformed  SSM  will  consist  simply  of  a 
-block  and  a  (copy  of  a)  bg-block,  concatenated.  Clearly, 
applying  the  duplication  transformation-  or  not  applying  it  -  to 
any  other  type  cf  ring  in  the  SSM  can  have  no  effect  upon  the 
number  of  elements  in  the  a^-rings.  (Applying  duplication  to 
the  ag-i'ings,  for  instance,  will  affec;  the  number  of  a^-rings  but 
not  the  number  of  elements  contained  in  each.)  The  expression 
for  follows  directly. 

Consider  next  k^.  Assume  for  the  moment  that  Aj=0.  Then  k^ 

will  oehave  exactly  as  k^:  if  A3=0,  each  airing  of  the  SSM  will  be 

either  an  explicit  ring  (03~i)  of  k°  b3-blocks  or  a  stack  (03=O)  of 

k0  bg-blocks  upon  a  b^-block;  and  if  a3=1,  each  airing  will  consist 

of  a  b3 -block  concatenated  with  a  b^-block.  On  the  other  hand  if 

Aj=l ,  each  b, -block  will  be  replaced  by  k°b3 -blocks,  one  for  each 

time  the  bg-block  of  an  a^-ring  is  duplicated  upon  a  bj -block.  In 

this  case,  if  Ao=0,  each  a0-ring  or  stack  will  contain  k°k°  b0-blocks. 
<1  10  0 

If  A3=l,  of  course,  each  a^-ring  will  still  consist  of  a  single  bg- 
block  concatenated  with  a  b4-block.  Again,  applying  the  duplication 

transformation  to  any  rings  other  than  the  a, -rings  or  the  a0-riup's 

1  o  ° 

cari  have  no  effect  upon  the  number  of  elements  in  the  a„ -rings, 

o 

The  expression  for  k^  follows  directly. 
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Using  arguments  similar  to  those  above,  we  can  easily  justify 
the  expressions  given  for  the  remaining  k  . 

In  addition  to  the  number  of  element  blocks  in  each  airing 
(i.e. ,  the  number  of  element  blocks  associated  directly  with  the 
head  of  each  a. -ring),  we  may  desire  to  know  the  number  of  element 
blocks  not  actually  in  each  airing  but  associated  indirectly  with  the 
head  of  that  ring.  For  instance,  we  may  want  to  know  the  number  of 
bg-blocks  associated  with  a  particular  bg-block.  Since  there  are 
k2  a^- rings  associated  with  each  bg-bloch  (from  the  fact  that  the 
ag-ring  of  wnich  the  b2~block  is  head  contains  k2  bg -blocks  and  there 
is  one  a^-ring  associated  with  each  of  these  b3~blocks)  and  since 
there  are  k^  b^-blocks  in  each  a^-ring,  we  determine  that  there 
are  kgk^  b^-blocks  associated  (indirectly)  with  each  bg-block. 

For  the  moment  let  the  notation  b^b^  designate  the  number  of 
^-blocks  associated  (either  directly  or  indirectly)  with  each  b^-block. 
Let  us  then  define  a  5  by  6  matrix  K  which  has  as  its  elements  those 
quantities  indicated  by  Table  3-1.  Using  arguments  similar  to  that 
used  above  to  determine  b^/b2= 

the  remaining  elements  of  K.  These  values  are  given  by  Table 
3-2.  We  will  denote  the  element  in  the  j-th  column  of  the  i-th  row 
of  K  by  Khj  where  ie  {1,2,**°  ,5}  and  je  {1, 2,  •  • » ,6}. 

It  might  be  well  at  this  point  to  consider  a  number  of  other 

89 


^2^4’  We  may  determine  values  for 


1 

2 

3 

4 

5 

6 

bl/b2 

b3A2 

b5/b2 

V°2 

V^S 

;blA 

Vb4 

b3/b4 

bA 

b/b4 

b9/b4 

r 

biA 

bA 

Vb8 

“5% 

bA 

bn/b6 

Vb8 

bg/bg 

b/b8 

bg/bg 

b,A 

bl/b10 

b3/bl( 

W 

_ _ 

bAo 

bg^io 

bnAo 

Table  3-1.  ,  Definition  of  Elements  of  Matrix  K. 
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-ry-t'vv 


quantifies  which  are  similar  in  many  respects  to  those  just  consid¬ 
ered  and  which  will  prove  useful  somewhat  later. 

Let  m  represent  the  number  of  copies  of  each  b1  -block  gener- 
ated  by  transformation  of  the  SSM.  Before  any  transformations  are 
applied  to  the  SSM,  ma=l,  of  course.  Using  an  argument  similar  to 
that  used  in  obtaining  expressions  for  the  various  k..,  we  obtain  the 
foil  >wing  expression  for  m  . 

cl 

ma=^2+A2k2^4+A4k4^6+A6k 6  ^A8+A8k<8^A10+A10k  10^^ 

Similarly,  let  m  represent  the  number  of  copies  of  each 

r 

bfi-DlocK  generated  by  transformation  of  tne  SSM. 

mp=  ^^^9^7  4  A7k  7^A5+A5k  5^3  f  A3k  g^+A^k  ^))j) 

Let  m  represent  the  number  of  copies  of  a  given  b,.-block 
associated  with  the  (copies  of  the)  b^ -block  representing  a  given 
source. 

mr1"  A6+A8k6^A8+A8k8^A10+A10k10^ 

Finally,  let  m  represent  the  number  of  copies  of  a  given 
2 

b^-block  associated  with  the  (copies  of  the)  b^ -block  representing 
a  given  target. 
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mr  =  V*5k°5 

Ct 


We  indicated  earlier  that  whencr^=0,  we  must  insure  that  there 
is  no  more  than  one  copy  of  each  -block  in  the  SSM,  and  when 
02=0,  we  must  insure  that  there  is  no  more  than  one  copy  of  each 
bjj -block.  We  now  have  the  means  to  enforce  these  two  constraints. 
If  0^=0,  we  must  restrict  Ag.A^,  Ag,Ag,  and  AjQ  to  values  which 
guarantee  m  =1.  Similarly,  if  oo=0,  we  must  restrict  A. ,  AQ,  AK, 

3  ci  1  o  D 

A, and  Ag  to  values  which  guarantee  m  =1.  In  particular, 

o,  m  <  1  and  j.  m  <  1. 
la—  2  p  — 

Our  discussions  so  far  have  been  concerned  mainly  with  the 
number  of  blocks  of  or  ,  type  which  are  associated  with  a  given  block 
of  some  other  type.  We  will,  however,  also  have  need  for  the  total 
number  of  blocks  of  a  given  type. 


Let  rm  represent  the  (average)  total  number  of  b^- blocks  in  the  SSM. 
where  ie-fl,  2,*  * c  ,11}. 

Tnere  are  k°  sources  and  m,  b.  -blocks  representing  each 
3.  1 

source  in  the  SSM.  Tneretore,  m,  =ra  k°.  Since  there  are  k, 

-blocks  associated  with  each  bg-block,  we  know  that  m2=m^/kj. 
Associated  with  each  bg- block  are  kg  bg-blocks.  Hence,  m^kgmg. 
Clearly,  m^rr^/kg*  and  so  on.  Carrying  out  all  indicated 
multiplications  and  divisions  will  result  in  the  expressions  given 


in  Table;  3-3. 


S3 


m.  =  m  k° 
1  a  a 


m. 


m„= 


m  k° 
a  a 


2  k. 


„  k2maka 

m3  =  k2“2  =  -X— 


m  k  m  k° 
m.=^  _  2  a  a 


4  "k. 


1Tk3 


“S  =  k4m4  =  -O 


k2k4mak°a 


r  3 


m( 

mc  =~ 
6  k„ 


k„k  m  k° 
2  4a  a 

~*lk3k5 


kjk.k-m  k° 

no,  =  k„m.  = 444-JL1 
7  66  kjkgkg 

k2k4k6maka 
klk3k5k7 


m7 

m8=I^- 


k  k.k.kQm  k° 
m„=k.m.  =  2  4  6  8  ^ 


8  8 


klk3k5k7 


m. 


m 


10  '  k. 


mll=k10m10 


k2k4k6k8maka 

klk3k5k7k9 

k2k4k6k8k10maka 

klk3k5k7k9 


Table  3-3.  Numbers  of  Blocks,  by  Type,  in  the  SSM 
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We  also  know,  of  course,  that  mit=m  ko. 

*  -  11  p  p 


Hence,  the  following 


equation  must  always  hold. 


klk3k5k7k9  mpkp  "  k2k4k6k8k10  mak  a 


In  particular,  it  lust  hold  for  the  SSM  before  the  application  of  any 
transformations: 


k0lk03k5k07k9kp 


Recalling  that  k°  =  1  and  kr^  =  1 ,  we  finally  get  the  equation 


ifOb-0 1^1,0  i,0  _  IfO  1.0 1-0 1.0  k0 

1  3T9  p  "  2  4  8  10  a 


We  will  later  find  this  relationship  to  oe  of  some  use  to  us. 
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Let  us  derive  another  expression  which  is,  at  the  least,  of 
some  academic  interest.  Assume  that  no  transformations  have 


been  applied  to  the  SSM.  The  number  of  -blocks  associated 

with  a  given  b^ -block  is  easily  determined  to  be  k°k°  .  This 

number,  however,  also  corresponds  to  the  number  of  relation 

symbols  associated  with  a  given  source  (i.e. ,  IP^  I  where  d.e  A 

is  the  data  item  represented  by  the  given  b^ -block).  The  number 

of  b^ -blocks  associated  with  a  given  bg-block  is  k°k°g  (recalling 

that  k°  =  l),  and  this  number  corresponds  to  the  number  of  targets 

associated  with  a  given  source/relation  symbol  pair.  Therefore, 

the  number  of  relation  symbol/target  pairs  associated  with  a 

given  source  must  be  k°k°k°k°g.  We  know  in  addition  that  A 

contains  k°  sources.  It  follows  then  that  the  total  number  of  relation 
a 

instances  (d^  r.  p^)  to  be  ^c-prt rented  by  the  SSM  (and  described 
by  the  DSM)  is  kok°k gk^gk^.  Using  the  result  obtained  above, 
we  may  say  that  the  total  number  of  relation  instances  is  also 
given  by  k°k°k°kOk° . 

Continuing  along  these  lines,  we  know  that  the  number  of 
targets  associated  with  a  given  source/relation  symbol  pair  cannot 
exceed  the  number  of  targets  in  IT.  Therefore, 


k°k°  <  k° 
k3k10  -  p 
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Simi’aily,  the  number  of  sources  associated  with  a  given  relation 
symbol/target  set  pair  cannot  exceed  the  number  of  sources  in  A. 
Tueref^p 


o 

1"3  - 


k°k~  < 


Also,  the  number  of  relation  symbols  associated  with  a  given  source 
cannot  exceed  the  number  of  relations  in  P .  As  a  result, 


^4  i  k  r 

The  same  thing  is  true  with  regard  to  the  number  of  (distinct) 

relation  symbols  associated  with  a  given  target.  Recall,  however, 

that  a  given  relation  ,,ymbol  may  be  associated  with  the  same  target 

more  than  once.  If  we  assume  that  the  expected  number  times  the 

same  relation  symbol  is  associated  with  a  given  target  is  represented 

by  m  ,  then  we  can  say 

rp 

k<ik09 

_  JJL  <  ko 

m  -  r 

r 

P 

4 

Since  each  relation  in  Pmust  appear  in  the  SSM  at  least  once,  the 

number  of  b„ -blocks  must  be  at  least  as  great  as  IP  I.  Hence, 
b 


k2k°4k°a 

K°lk°3 


>  k° 

-  r 
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This  last  result  suggests  another.  Let  mr  represent  the  expected 

number  of  b.-blocks  which  contain  the  same  relation  symbol.  Since 

the  SSM  contains  m.  b„ -blocks  and  since  there  are  k°  distinct  relation 

6  6  r 

symbols,  it  follows  that 


k2k4mak°a 

klk3k5kr 


if  no  transformations  have  been  applied  to  theSSM. 

3.6  Flexibility  of  Model 

The  reader  has  undoubtedly  noticed  that  despite  our  stated  desire 
tc  design  a  storage  structure  mod^l  capable  of  assuming  the  form  of 
any  of  the  three  basic  storage  organizations,  we  have,  made  no  men¬ 
tion  of  the  random  organization.  Be  assured,  we  have  not  forgotten. 

Our  reason  for  ignoring  the  random  organization  to  this  point 
is  simply  that  the  random  organization  is  basically  the  same  as  the 
list  organization,  the  principal  difference  being  that  in  a  list  organi¬ 
zation  a  given  item  contains  an  explicit  pointer  to  another  item,  where¬ 
as  in  a  random  organization  the  given  item  contains  a  key  which  is  used 
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to  determine  in  some  way  a  pointer  to  the  other  item;  The  result  is 
that  in  getting  from  one  item  to  another  the  random  organization  will 
generally  be  slower  than  (or  at  least  different  in  time  from)  the  list 
organization  (since  one  must  look  in  a  dictionary,  do  some  calcula¬ 
tion,  etc,  in  order  to  determine  the  pointer).  Later  we  will  see 
that  this  difference  can  easily  be  reflected  by  assuming  different 
amounts  of  time  for  following  a  list  pointer  and  for  following  a 
random  '‘pointer".  Since  in  general  a  key  is  required  for  each 
random  "pointer",  the  number  of  fields  (although  possibly  different 
in  size)  will  be  the  same  as  for  a  compilable  list  organization. 
However,  in  comparing  the  storage  requirements  of  the  two  organi¬ 
zations  one  must  be  careful  to  include  the  r..orage  required  of  a 
dictionary,  if  that  particular  method  of  implementation  for  the  ran¬ 
dom  organization  is  uced,  since  it  will  not  be  included  explicitly  in 
our  model. 

Thus,  our  model  does  indeed  encompass  the  random  organi¬ 
zation  without  extension  or  modification. 

Let  us  examine  now  the  flexibility  of  our  model.  At  the  very 
least  we  would  like  some  assurance  that  the  SSM  is  capable  of 
representing  such  basic  structures  as  a  simple  stack  and  a  simple 
linear  list. 

Presupposing  a  structure  such  as  a  stack  or  linear  list  assumes, 


of  course,  that  the  intrinsic  structure  of  the  data  to  be  represented 

is  linear  in  nature.  This  means  that  each  relation  to  be  represented 
is  one-to-one.  That  is,  a  given  data  item  is  related  to  exactly  one 
other  data  item  by  some  given  relation  and  vice  versa.  If  this  is  ,true, 
it  follows  that  k?  =  1  for  all  ie{l ,  2,  *  *  * ,  10} .  The  SSM  will  then  appear 
as  in  Figure  3-11. 

Suppose  that  we  assign  tha  following  values  to  our  various 
decision  variables: 

0.  =  0!  =  0V  =  0  for  all  ie  {1, 2,  •  •  • ,  10} 

Ai  =  l  for  all  ie  {1,2,*“  ,10} 

t.=0  for  all  i€  {1, 2,  *  •  •  ,11} 

aj  =  0,  a2  =  1 

pi=p3  =  °,  p2  =  i 

It  follows  from  Constraint  4'  that  ,3g“0  and  0.  =  1  for  all  ie{2,3,*  •  •  10} 
and  i  /  6.  The  SSM  will  now  appear  as  in  Figure  3-12,  where  the 
double -walled  field  of  each  block  corresponds  to  a  description  block. 

If  there  is  no  occasion  to  access  a  relation  instance  via  the 
relation  symbol,  v;e  may  choose  to  eliminate  the  relation  rings  from 
the  structure.  (We  assume  that  the  relation  rings  are  always  present, 
but  if  our  analysis  should  show  that  they  are  never  used,  >ve  may 
obviously  eliminate  them  from  the  structure.)  We  may  make 
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Figure  3-12.  Transformed  Storage  Structure  Model  -  1 


102 


similar  decisions  about  the  source  and  target  ring;!  (not  shown  in 
the  figure).  Furthermore,  if  P  contains  but  a  single  relation  or  if 
we  are  not  interested  in  the  individal  relation  symbols  per  se,  we 
may  set  =  0  and  eliminate  the  relation  symbol  name  field  from 
each  block.  Assuming  that  we  choose  all  of  these  options,  the  SSM 
will  appear  as  in  Figure  3- 13  (where  we  have  assumed  for  illustration 
that  A  and  II  are  not  disjoint  and  that  any  two  data  items  are  related 
by  at  most  one  relation) . 

Clearly,  the  structure  shown  in  Figure  3-13  is  a  simple  linear 
list.  By  setting  =  0  it  can  be  transformed  into  a  simple  stack. 
Thus,  we  have  satisfied  our  first  requirement,  being  capable  of 
representing  the  very  basic  storage  structures. 

As  another  example  to  illustrate  the  flexibility  of  the  SSM  and 
its  ability  to  represent  in  varying  degrees  of  detail  the  structure  of 
the  DSM,  suppose  we  are  given  some  collection  of  data  items  for 
which  the  untransformed  SGM  appears  exactly  as  shown  in  Figure  3-10 
(i.e. ,  with  all  dotted  lines  made  solid).  Furthermore,  suppose  that 
the  only  information  which  we  wish  to  "share”  is  that  concerning 
target  sets.  That  is,  we  want  to  represent  explicitly  only  the  com¬ 
position  of  each  target  set  and  tne  source/relation  symbol  pairs 
associated  with  that  target  set.  Clearly,  all  other  "shared"  items 
will  be  implicit  in  the  resultant  structure  but  will  require 
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J 


Figure  3-13.  Transformed  Storage  Structure  Model  -  2 
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interrogation  of  the  structure  to  determine. 

Let  us  assign  the  following  values  to  our  various  decision 
variables: 


h  =  02  *  *3  =  04  *  05  *  06  =  °>  07  =  08  =1-  %  =  01O  =  0 

0!  =  07  =  0  for  all  ie{l,2,-  •  •  ,10} 

A1  =  A2=A3  =  A4  =  A5  =  A6=1,  A7  =  A8  =  °’  A9=A10  =  ' 

^  =  0  for  all  ie{l, 2, •  * '  ,11} 

ox  =  0,  a2  =  1 

»\  =P3=0'  P2  =  1 

By  Constraint  4’  02  =  =  %  =  1,  P6  =  =  Pg  =  Pg  *  °»  P10  = 

The  SSM  will  then  appear  as  in  Figure  3-14.  Each  ag-rmg  in  this 

structure  represents  a  target  set  and  each  a^-rin,;  represents  the 
set  of  all  source/relation  symbol  pairs  which  are  associated  with 
a  given  target  set. 

It  is  evident  that  using  the  SSM  we  can  represent  as  much  or  as 
little  of  the  detail  of  the  DSM  as  we  wish. 

As  a  somewhat  different  example,  let  us  consider  how  the 
software-simulated  associative  memory  lesigned  by  Feldman  (and 
mentioned  in  Chapter  I)  could  be  specified  within  the  context  of  the 
SSM. 
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Storage  Stru 


In  Feldman’s  implementation  each  cell  of  the  "memory”  contains 
five  fields  as  shown  in  Figure  3-15,  three  information  fields  Fp  Fg, 
and  Fg  and  two  link  fields,  and  Lg.  Recall  that  the  address  of  a 
given  cell  is  determined  by  hashing  the  contents  of  fields  F^  and  F„. 

The  field  of  a  cell  is  used  to  associate  additional  cells  (of 
the  same  form  as  that  shown  in  Figure  3-15)  with  the  cell  accessed 
by  hashing  F  and  F0  in  order  to  handle  the  multiplicity  problem 
(i.e. ,  the  situation  in  which  more  than  one  value  for  the  field  Fg  is 
associated  with  a  given  combination  of  values  for  fields  F^  and  Fg) 
and  the  overlap  problem  (i.  e. ,  the  situation  in  which  different 
Fi/F2  value  pairs  hash  to  the  same  address).  In  particular,  the 
L<i  field  contains  a  pointer  to  one  of  the  cells  in  a  ring  of  cells 
containing  the  value  combinations  of  all  triples  (F^F^Fg)  for 
which  the  values  of  F^  and  Fg  hash  to  the  given  address. 

The  Lg  field,  on  the  other  hand,  is  used  to  link  in  rings  all 
cells  having  the  same  values  for  the  field  Fg.  These  rings  make 
it  possible  to  determine  the  values  of  fields  F^  or  F2  or  both 
given  a  value  for  field  Fg. 

Suppose,  for  instance,  we  are  given  the  following  triples  of 
values  for  the  fields  Fp  Fg,  and  Fg  (in  that  order): 
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(al,bl,cl) 


(a1,b1,c2) 

^a2,b2,C3^ 

(a3,b3,cg) 

where  the  combination  of  a^  and  b^  hashes  to  the  same  address  as 
the  combination  of  a.^  and  bg.  The  resulting  structure  would  appear 
as  shown  in  Figure  3-16,  where  the  dashed  pointers  represent 
addresses  determined  by  hashing  the  indicated  pairs  of  values. 

Let  us  define  our  data  items  and  the  corresponding  data  structure 
for  this  case  as  follows: 


(1)  Let  the  set  S  consist  of  all  (ordered)  triples 

(a  ,  b.  ,  c.  )  of  values  for  the  fields  F^F,,,  and 
*1  x2  l3  1  Z 

Fg  to  be  represented  by  the  storage  structure. 

(2)  A  =  {(a  ,  b  )  |  (a.  ,  b  ,  c.  )  e  H  for  seme  c.  } 

*1  l2  1  2  l3  3 

(3)  II  ={c.  |  (a.  ,  b.  ,  c.  )  e  H  for  some  a.  and  b.  } 

V  *1  *2  *3  ll  l2 

(4)  Let  P  contain  a  single'  relation  r  from  A  to  n 
with  the  solution  set  R  such  that 


((a.  ,b  ),  c  )  e  R<£ 

An 


(a  ,b.  ,c  )  € H 

A  ft 
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Since  in  his  storage  structure  Feldman  permits  -  in  fact,  if 
conditions  warrant,  requires  -  the  duplication  of  data  item  descrip¬ 
tions,  we  shall  relax  (for  this  case)  our  earlier  assumption  that 
data  item  description  blocks  may  not  be  duplicated  * 

Let  us  assign  the  following  values  to  our  decision  variables: 


V 

:  0”  =  O 

for 

ail  ie  { 1 , 2 ,  *  • 

•,10} 

Ai 

=  1 

for 

all  ie  { 1, 2,  *  - 

• ,  10} 

T. 

1 

=  0 

for 

all  ie  { 1, 2,  *  * 

%  11} 

ai 

=  °2 

=  0 

P1 

=  P2 

~p3  =  0 

It  follows  from  ConstrL.t  4'  that  ^  =  1  for  all  ie  {2,3,  •  •  • ,  10}. 

These  decision  variable  values  result  simply  in  the  stack 
structure  described  earlier,  except  in  this  case  the  sets  A  and  II 
are  disjoint  end  the  relation  considered  is  not  one-to-one.  Since 
we  have  allowed  description  blocks  to  be  duplicated,  the  storage 
structure  consists  of  isolated  blocks  el  storage  each  of  which  con¬ 
tains  the  values  of  one  of  the  triples  in  H.  Passing  through  each 
of  these  blocks  is  a  source  ring,  a  relation  ring,  and  a  target  ring. 

* 

To  be  completely  general,  we  could  include  in  our  model  a 
decision  variable  which  indicates  whether  or  not  multiple  data 
item  descriptions  are  to  be  allowed.  Although  this  can  very  easily 
be  done,  we  choose  not  to  do  so. 
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U  we  assume  that  (1)  instead  of  a  source  ring  for  each  distinct 
source  in  A,  we  have  a  source  ring  for  each  set  of  sources  vhich 
hash  to  a  common  address,  (2)  instead  of  a  separate  head  for  each 
source  ring,  we  use  that  b^ -block  which  has  the  address  to  which 
all  sources  in  the  corresponding  set  hash,  and  (3)  the  relation  ring 
(of  which  there  is  only  one)  is  discarded,  then  the  resultant  structure 
is  identically  equal  to  Feldman's.  The  source  rings  correspond  to 
Feldman's  rings  of  links  and  the  target  rings  correspond  to  his 
rings  of  L2  links. 

We  can  easily  justify  the  two  assumptions  concerning  the  source 
rings.  First,  combining  into  one  source  ring  all  sources  which  hash 
to  a  common  address  is  reasonable  (if  not  mandatory),  since  these 
souaC°s  are  ^distinguishable  unless  a  different  method  for  deter¬ 
mining  the  location  of  the  head  of  a  source  ring  is  used.  Second, 
using  one  of  the  elements  (although  a  special  element)  of  a  source 
ring  as  head  instead  of  creating  a  special,  distinct  head  does  not 
alter  the  structure  or  utility  of  that  ring.  In  fact,  this  may  always 
be  preferable,  since  it  reduces  the  size  of  the  ring. 

As  a  final  somewhat  more  complex  example,  let  us  consider  how 

Childs'  [11]  storage  representation  of  his  so-called  Set-Theoretic 

* 

Data  Structure  (STDS)  could  be  specified  within  the  context  of  the  SSM. 

* 

Note  that  Childs'  use  of  the  term  data  structure  cri  responds  roughly 
to  our  definition  of  the  term  storage  structure. 
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Childs  defines  a  Set- Theoretic  Data  Structure  as  a  storage  repre¬ 
sentation  of  secs  and  set  operations  such  that  given  any  family  N  of  sets 
and  any  collection  S  of  set  operations,  an  STDS  is  any  storage  represen¬ 
tation  which  is  isomorphic  to  N  with  S.  In  particular,  an  STDS  -  shown 
schematically  in  Figure  3-17  -  is  composed  of  five  structurally  independ¬ 
ent  parts: 

(1)  a  collection  S  of  set  operations 

(2)  a  set  B  of  data  item  names 

(3)  a  collection  of  data  item  definitions,  one  for  each  data  item  name 

(4)  a  collection  N  of  set  names 

(5)  a  collection  of  set  representations,  one  for  each  set  name 

Sets,  the  collection  N  of  set  names,  and  the  collection  B  of  data 

item  names  are  represented  by  blocks  of  contiguous  storage  locations. 
The  address  of  a  location  in  the  block  representing  N  is  a  set  name  and 
the  content  of  a  location  in  this  block  is  the  address  of  a  block  represen¬ 
ting  the  corresponding  set.  Similarly,  the  address  of  a  location  in  the 
block  representing  B  is  a  data  item  name  and  the  content  of  a  location 
in  this  block  is  the  address  of  a  stored  description  oi  the  corresponding 
data  item.  The  blocks  which  represent  individual  sets  contain  the  names 
of  the  data  items  which  constitute  the  sets. 

In  an  effort  to  minimize  the  storage  occupied  by  the  STDS,  Childs 
defines  two  types  of  sets:  generator  sets  and  composite  sets.  Only  the 


SET  NAMES:  N  SET  REPRESENTATIONS  DATA  ITEM  NAMES:  B  DATA 

DESCRIPTIONS 


Figure  3-17.  Set-Theoretic  Data  Structure  (Childs) 


generator  sets  have  storage  representations;  he  generator  sets  are 
disjoint;  and  the  composite  sets  are  unions  of  generator  sets. 

With  these  general  characteristics  in  mind  we  can  examine  the 

STDS  in  more  detail.  The  block  of  locations  representing  the  collection 

B  of  data  item  names  is  assumed  to  have  location  b  as  the  address  of 

o 

its  head.  The  first  location  containing  a  pointer  to  a  data  item  descrip¬ 
tion  has  the  address  bQ+l,  and  the  location  containing  a  pointer  to  the 
i-th  data  item  description  has  the  address  bQ+i.  If  b  represents  the 
cardinality  of  the  set  B,  then  the  last  location  containing  a  pointer  to  a 
data  item  description  is  b  +b.  Since  all  pointers  to  data  item  descrip- 
tions  are  located  between  bQ+l  and  bo+b,  B  may  be  represented  by  the 
set  of  integers  {l,  2, . . .  ,b}.  Therefore,  any  integer  i  such  that 
l<i<b  is  the  data  item  name  for  the  pointer  to  the  i-th  data  item  des¬ 
cription.  The  pointer  t<,  the  i-th  data  item  locates  a  block  of  storage 
containing  a  description  of  the  i-th  data  item  and  a  list  of  all  generator 
set  names  (elements  of  N)  of  which  the  i-th  data  item  is  a  constituent. 

The  block  of  locations  representing  the  collection  N  of  set  names 
is  similar  to  the  block  representing  B .  with  nQ  and  n  as  the  address  of 
the  head  and  the  cardinality,  respectively.  The  contents  of  the  block 
representing  N  are  pointers,  also.  The  pointers  fall  into  two  classes 
ana  ore  distinguished  by  an  integer  n*  such  that  l<n*<n.  For  all 
l_Ci<n*,  i  is  the  name  of  a  generator  set,  and  for  all  n*<i<n,  i  is 
the  name  of  a  composite  set.  A  g  merator  set  has  a  set  representation, 
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while  a  composite  set  does  not  since  it  is  the  union  of  some  generator 
sets.  For  i>n*  the  pointer  in  nQ+i  locates  a  block  of  storage  containing 
the  names  of  generator  sets,  the  union  of  which  forms  the  corresponding 
composite  set.  For  i<n*  the  pointer  in  nQ+i  locates  a  block  of  storage 
containing  the  names  of  all  composite  sets  which  use  generator  set  i 
and  a  pointer  to  a  block  of  locations  containing  the  names  of  those  data 
items  which  form  the  set. 

Let  us  define  our  data  items  and  the  corresponding  data  structure 
for  the  STDS  as  follows: 

(1)  Let  N  be  the  collection  of  set  names. 

(2)  Let  M  be  the  collection  of  sets  themselves. 

(3)  Let  M*  be  the  collection  of  generator  set  descriptions. 

(4)  Let  B  be  the  collection  of  data  item  names. 

(5)  Let  D  be  the  collect!™  of  data  item  descriptions. 

(6)  Let  r^  be  a  one-to-one  relation  from  N  onto  M  which  assigns 
to  every  set  name  in  N  a  unique  set  in  M. 

(7)  Let  Tg  be  a  one-to-one  relation  from  M  to  M*  wnich  assigns 
to  every  generator  set  in  M  a  unique  description  in  M*. 

(8)  Let  r^  be  a  relation  from  M  to  N  which  assigns  to  every  genera¬ 
tor  set  in  M  the  names  in  N  of  those  composite  sets  of  which 

the  given  generator  set  is  a  subset. 
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(9)  Let  be  a  relation  from  M  to  N  which  assigns  to  every  com¬ 
posite  set  in  M  the  names  in  N  of  those  generator  sets  which 
are  sheets  of  the  given  composite  set. 

(10)  Let  Tj.  be  a  relation  from  M*  to  B  which  assigns  to  every  gen¬ 
erator  set  description  in  M*  the  names  in  B  of  those  data  items 
which  are  elements  of  the  given  set. 

(11)  Let  r_  be  a  one-to-one  relation  from  B  onto  D  which  assigns 

o  - 

to  every  data  item  name  in  B  a  unique  description  in  D. 

(12)  Let  r^  be  a  relation  from  D  to  N  which  assigns  to  every  data 
item  description  in  D  the  names  in  N  of  those  generator  sets 
of  which  the  given  data  item  is  an  element. 

From  these  definitions  we  see  that 
P  ={r.|j=l,2,...,7} 

A=n  =  NUMU  M*  U  B  U  D 

Let  us  assign  the  following  values  to  our  decision  variables: 


<t>.  : 

1 

=  0!  =  0.”  =  0 

l  l 

for 

all 

ie{l,  2, . 

•  •  >  I®} 

=  1 

for 

all 

ie{l,2,. 

..,10} 

=  0 

for 

all 

ie{l,2,. 

..,11} 

“°>  *2  =1 

P1 

=  p2  =  ^3  =  ® 

It  follows  from  Constraint  4'  that  0.  =  1  for  all  ie (2, 3, .  „ . ,  10} . 
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Assuming  that  source,  relation,  and  target  rings  are  not  used, 
these  decision  variable  values  result  in  the  storage  structure  of  Figure 
3-18. 

Ignoring  for  the  moment  the  fact  that  in  the  STDS  the  blocks  repre¬ 
senting  set  names  are  adjacent  to  one  another,  as  are  the  blocks  repre¬ 
senting  data  item  names,  it  is  clear  that  the  STDS  is  simply  the  storage 
structure  of  Figure  3-18.  To  elaborate  upon  this  somewhat,  let  us 
examine  the  various  blocks  of  the  STDS  as  shown  in  Figure  3-17,  start¬ 
ing  at  the  left  with  the  blocks  representing  set  names  and  working  to 
the  right  through  the  blocks  representing  generator  and  composite  sets, 
generator  set  descriptions,  data  item  names,  and  data  item  descriptions. 

Each  block  representing  a  set  name  (i.e. ,  each  location  in  the  block 
of  locations  representing  the  set  N)  is  equivalent  tc  the  structure  of 
Figure  3-18  for  m=l.  In  this  case  the  description  block  of  the  SSM  is 
null  and  the  relation  of  interest  is  r^. 

Each  block  representing  a  generator  or  composite  set  is  also  equiv¬ 
alent  to  the  structure  of  Figure  3-18.  The  description  block  of  the  SSM 
is  again  null,  but  m>l.  For  composite  sets  the  relation  of  interest  is 
r^.  On  the  other  hand,  for  generator  sets  there  are  two  relations  of 
interest,  namely,  ^  and  r^.  If  we  assume  that  the  description  block 
indicator  for  the  target  of  relation  r 2  (which  has  only  a  single  target 
since  it  is  one-to-one)  appears  first  in  the  stack  of  description  block 
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ti 


Figure  3-18.  Transformed  Storage  Structure  Model  -  4 
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indicators ,  followed  by  those  for  the  targets  of  relation  r^,  then  there 
is  no  need  of  a  relation  symbol  field  and  we  may  continue  to  assume 

The  blocks  representing  generator  set  descriptions  correspond  to 
the  structure  of  Figure  3-18  for  relation  r,.  and  m>l. 

The  situation  for  the  blocks  representing  data  item  names  is  anal¬ 
ogous  to  that  for  the  blocks  representing  set  names  and  the  relation  of 

interest  is  r_. 

b 

Finally,  each  block  representing  a  data  item  description  is  equiva¬ 
lent  to  the  structure  of  Figure  3-18  for  m>l  and  r^  as  the  relation  of 
interest. 

Let  us  consider  now  the  adjacency  of  the  blocks  representing  set 
names  and  that  of  the  blocks  representing  data  item  names.  Since  the 
collection  of  set  names  and  the  collection  of  data  item  names  act  as 
the  (only)  entry  points  into  the  STDS,  we  will  include  (contrary  tc  our 
earlier  assumption)  source  rings  for  the  set  names  and  the  data  item 
names.  Because  there  is  no  duplication  of  these  names  within  the  STDS, 
the  source  rings  need  not  be  closed,  however.  In  particular,  a  source 
ring  need  contain  only  a  pointer  from  the  head  of  that  "ring"  to  the 
b^-block  representing  the  corresponding  source,  and  the  b^-block 
need  not  contain  a  pointer  back  to  the  head.  Moreover,  by  stacking  all 
blocks  representing  set  names  and  by  doing  the  same  for  all  blocks 
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representing  data  item  names  and  then  by  assuming  (as  Childs  does) 
that  set  names  are  integers  between  1  and  n  and  that  data  item  names 
are  integers  between  1.  and  b,  we  may  implement  the  source  ring 
’’pointers”  via  a  special  case  of  the  calculation  method  of  the  random 
data  organization  described  in  Chapter  I.  Given  a  set  name  (an  integer) 
or  a  data  item  name  (an  integer),  we  can  determine  the  (unique)  address 
of  the  corresponding  block  simply  by  adding  nQ  or  bQ,  respectively,  to 
the  name.  The  result  is  the  STDS  described  by  Childs. 

We  note  that  for  completeness  we  may  also  wish  to  define  two  order¬ 
ing  relations  r  and  rQ  for  the  collection  N  of  set  names  and  the  collec- 
o  y 

tion  B  of  data  item  names,  respectively.  These  relations  may  be  repre¬ 
sented  within  the  context  oi  the  SSM  simply  by  setting  a  =0  and  using  all 
other  decision  variable  values  as  given  for  the  structure  of  Figure  3-18. 

This  concludes  our  discussion  of  the  Storage  Structure  Model. 

We  have  described  in  this  chapter  a  model  for  the  storage  structures 
capable  of  representing,  via  the  three  basic  organizational  methods  , 
the  intrinsic  structure  of  a  given  collection  of  data  items  as  described 
by  the  Data  Structure  Model  of  Chapter  II.  The  model  consists  of  a 
given  relatively  general  storage  structure  (Figure  3-10)  together 
with  a  number  of  transformations  which  may  be  applied  to  this 
storage  structure,  the  combination  of  which  is  described  by  a  number 
of  binary -valued  decision  variables.  In  addition  we  have  shown  the 
model  to  be  flexible  and  capable  of  representing  the  intrinsic 

structure  of  data  in  varying  degrees  of  explicit  detail. 
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Chapter  IV 


ANALYSIS  OF  THE  STORAGE  STRUCTURE  MODEL 

In  the  previous  chapter  we  developed  a  model  for  the  storage 
structures  capable  of  representing  the  intrinsic  structure  of  some 
collection  of  da.a  items  as  given  by  the  Data  Structure  Model  devel¬ 
oped  in  Chapter  n.  Our  purpose  for  developing  the  Storage  Structure 
Model  was  not  simply  to  illustrate  the  fact  that  the  intrinsic  structure 
of  some  given  collection  of  data  items  can  be  represented  by  a  multitude 
of  storage  structures.  Rather  it  was  our  purpose  to  develop  a  vehicle 
by  means  of  v  lich  we  can  compare  the  relative  morits  of  these 
storage  structures. 

Our  goal  in  this  chapter  will  be  the  development  of  certain 
measures  of  performance  for  the  SSM  which  we  can  use  to  make  these 
comparisons. 
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4. 1  Measures  of  Performance 


It  is  generally  conceded  that  the  amount  of  time  required  to 
perform  certain  operations  upon  a  given  storage  structure  and  the 
amount  of  storage  occupied  by  that  structure  are  the  two  most 
important  factors  to  be  considered  in  determining  the  "goodness” 
of  the  structure.  One  might  argue  that  a  factor  such  as  the  number 
of  man-hours  required  for  implementation  of  the  software  associated 
with  a  given  storage  structure  is  equally  important,  but  if  the  struc¬ 
ture  is  to  see  more  than  limited  use,  this  factor  is  of  transient 
interest  and,  hence,  of  little  importance  by  comparison. 

It  is  also  genarally  (but  not  always)  true  that  one  may  reduce 
time  at  the  expense  of  increasing  storage  and  vice  versa. 

Under  many  computer  operating  systems-  the  batch,  mono- 
programmed,  real-memory  systems,  for  instance -the  cost  of 
solving  a  given  problem  (i.  e. ,  running  a  given  program)  is  based 
soley  upon  the  CPU  time  required  to  achieve  the  solution,  with  the 
only  constraint  placed  upon  the  storage  being  that  the  program  and 
storage  structure  must  fit  into  the  available  storage  (usually,  quite 
substantial) . 

At  the  other  extreme,  with  time-shared,  multi-programmed, 
virtual- memory  systems,  the  cost  of  solving  a  problem  is  often  a 
function  of  the  amount  of  storage  occupied  as  well  as  the  CPU  time, 
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although  for  these  systems  the  amount  of  storage  available  for  a 
program  and  its  storage  structure  is  virtually  unlimited. 

In  the  first  case,  that  storage  structure  which  causes  CPU  time 
to  be  minimized  and  occupies  no  more  thar.  the  storage  available  is 
clearly  the  best.  In  other  words,  we  are  willing  to  use  any  amount 
of  storage,  up  to  the  maximum  amount  available,  as  long  as  the  use 
of  that  storage  allows  CPU  time  to  be  reduced. 

In  the  second  case,  however,  no  such  clear-cut  policy  exists. 
Suppose  that  the  cost  C  of  running  a  program  is  given  by 

C  =k  f  T(1+0.01P)1 

where  T  represents  CPU  time,  P  represents  the  average  number  of 
pages  of  virtual  memory  required  during  the  running  of  the  program, 
and  k  represents  the  dollar  cost  per  unit  of  CPU  time.  (This  is 
the  charging  scheme  currently  in  v.se  for  MTS,  the  Michigan  Terminal 
System,  at  The  University  of  Michigan.  [  ] )  Increasing  by  100 

pages  the  amount  of  storage  used  is  equivalent  to  increasing  CPU 
time  by  T  units. 

The  question  we  would  like  to  ask  is,  How  much  must  CPU  time 
be  reduced  in  order  to  justify  an  increase  of  100  pages  in  the  virtual 
memory  used?  Let  T  represent  CPU  time  for  P  pages  of  memory 
and  let  T-t  represent  CPU  time  for  P+100  pages,  where  t  represents 
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the  net  reduction  in  CPU  time.  Then 


(T-t)[  1+.01(P+100)]  <_T(1+.01P) 

must  be  true  in  order  to  justify  the  increase  in  storage.  It  follows 
that 

T 

t  >  - 1 - 

-  2+.01P 

must  be  true  to  effect  a  net  gain  when  storage  is  increased  by  100 
pages.  Figure  4-1  contains  a  plot  of  t/T,  the  minimum  fractional 
reduction  of  CPU  time  required  to  offset  an  increase  of  100  pages 
of  storage,  versus  P,  the  initial  storage  requirement. 

Clearly,  a  smaller  percentage  reduction  in  CPU  time  is 
necessary  to  justify  an  increase  in  the  storage  used  if  the  storage 
requirement  is  large  to  begin  with. 

We  are  faced,  however,  with  the  problem  of  not  knowing  how 
CPU  time  and  storage  will  vary  as  the  storage  structure  is  perturbed 
Moreover,  we  have  considered  but  a  single  cost  function.  We  must 
therefore,  develop  a  procedure  for  evaluating  the  time  and  storage 
requirements  of  each  storage  structure  represented  by  the  SSM,  while 
at  the  same  time  remaining  flexible  enough  to  consider  a  variety  of 
cost  functions. 

Basically,  the  procedure  which  we  will  use  here  is  to  determine 
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Figure  4-1.  Minimum  Fractional  Reduction  of  CPU  Time  Necessary 
to  Offset  an  Increase  of  100  Pages  in  Program  Size 
for  Cost  Function  C=k[T(l+0.  01P)  J 
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the  storage  structure  that  minimizes  time,  subject  to  a  constraint 
on  storage.  Clearly,  this  guarantees  an  optimal  storage  structure 
for  the  batch,  monoprogram  med,  real-memory  systems. 

For  systems  with  more  complex  c  ^unctions,  if  we  ease 
the  storage  constraint  to  allow  for  virtually  infinite  storage,  we  will 
be  guaranteed  a  storage  structure  which  is  optimal  in  the  sense  that 
it  r.inimizes  time,  although  it  may  not  be  optimal  in  the  sense  that 
it  minimize'!  the  given  cost  function.  Then  using  the  storage  struc¬ 
ture  determined  in  this  manner  as  a  basis  for  comparison  we  may 
constrain  the  available  storage  to  be  less  than  that  required  for  our 
comparison  structure  and  determine  the  storage  structure  which 
minimizes  time  subject  to  this  constraint.  If  the  storage  structure 
thus  determined  results  in  a  lower  value  for  the  given  cost  function, 
we  may  use  it  as  our  comparison  structure  and  repeat  the  procedure. 
If  the  value  of  the  cost  function  exceeds  the  best  so  far,  we  may  ease 
the  storage  constraint  somewhat  and  then  repeat  the  rrocedure. 

If  the  given  cost  function  behaves  as  shown  in  Figure  4-2(a),  this 
procedure  will  allow  us  to  come  as  close  to  the  optimal  solution  as 
we  wish,  but  if  the  given  cost  function  behaves  as  shown  in  Figure 
4-2(b),  we  may  in  fact  determine  a  solution  which  is  only  suboptimal. 
In  each  of  these  figures  C*  represents  the  value  of  the  cost  function 
for  the  storage  structure  obtained  when  storage  is  unconstrained. 


127 


Since  we  have  no  way  of  determining  the  behavior  of  the  cost  function 
we  can  guarantee  only  a  suboptimal  solution  (provided  the  process  con¬ 
verges)  .  Of  course,  if  we  are  willing  to  determine  the  best  solution  for 
every  possible  storage  constraint  between  0  and  the  storage  required 
for  our  initial  solution,  we  can  always  guarantee  an  optimal  solution. 

Our  discussions  for  the  remainder  of  this  chapter  will  assume 
simply  that  we  are  interested  in  determining  that  storage  structure 
which  minimizes  time  subject  to  a  constraint  (possibly  infinite) 
on  storage. 

4.  2  Time  Cost  Function 

We  have  determined  that  in  order  to  compare  the  relative  merits 
of  a  collection  of  storage  structures  we  must  characterize  each 
storage  structure  by  two  quantities:  time  and  storage. 

The  storage  characteristic  of  a  given  storage  structure  is  simply 
the  number  of  storage  units  occupied  by  the  structure  and,  hence,  is 
relatively  well  defined.  The  time  characteristic  of  a  storage  struc¬ 
ture  is  not,  however,  so  well  defined. 

Intuitively,  we  feel  that  the  time  characteristic  should  be  some 
measure  of  the  amount  of  time  required  to  perform  certain  operations 
upon  the  given  storage  structure.  These  operations  may,  of  course, 
vary  with  the  problem  being  solved. 

We  will  define  a  number  of  primitive  operations  which  are 
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representative  of  the  types  of  operations  one  might  wish  to  perform 
upon  a  collection  of  data  items  and  from  which  one  can  construct 
other  more  complex  operations.  Then  in  order  to  define  a  parti¬ 
cular  problem  which  we  wish  to  solve,  we  will  assign  weights  to 
each  of  these  primitive  operations  to  reflect  the  relative  frequency 
with  which  the  operation  is  used  in  the  solution  of  the  problem. 

Let  where  ie  {1 , 2,  •  •  • ,  N}  represent  some  primitive  operation 
and  let  a.^  represent  the  relative  frequency  of  that  operation.  As  a 
matter  of  convenience  let  us  place  the  following  constraints  upon  the 


various  a^: 


0<a.  <1.0  fox  ie  {1, 2,  •  •  •  ,N} 
N 


Z 

i=l 


a .  -  1 . 0 


Then  if  we  can  determine  the  number  of  time  units  t^.  required  to 
perform  operation  Q.,  where  ie  {1, 2,  •  •  •  ,N},  using  a  particular 
storage  structure,  we  will  define  the  time  cost  T  for  that  storage 
structure  as 

N 

T  ■  £,  ^ 

Assuming  no  constraint  on  storage,  th  at  storage  structure  for 
which  T  is  the  smallest  will  be  considered  the  optimal  structure  to 
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use  in  the  solution  of  the  given  problem  (as  defined  by  the  various 

ai). 

Thus,  we  are  faced  with  two  problem  s: 

(1)  Definition  of  the  primitive  operations  for 
ie{M, -,N}. 

(2)  Determination  of  the  time  cost  tj  for  each 
primitive  operation  Q^. 

In  defining  the  primitive  operations  we  may  separate  them  into 
two  classes:  interrogative  operations  and  manipulative  operations. 
Interrogative  operations  query  a  storage  structure  to  determine 
certain  information  about  the  data  represented,  such  as  the  targets 
associated  with  a  given  source/relation  symbol  pair.  On  the  other 
hand,  manipulative  operations  result  in  alterations  to  the  structure 
to  effect  changes  in  the  data  represented,  such  as  the  addition  of 
a  relation  instance  to  the  structure. 

Although  the  manipulative  operations  are  no  less  important, 
we  will  confine  ourselves  to  considering  only  the  interrogative  opera¬ 
tions.  The  reason  for  this  restriction  is  simply  to  narrow  the  scope 
of  our  problem.  The  techniques  which  we  develop  here  will  be 
e\  Tally  applicable  to  both  types  of  operations.  Thus,  given  sufficient 
time  we  could  extend  our  consideration  to  the  manipulative  operations 
as  well. 
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4.2.1  Primitive  Operations 


Since  our  model  for  the  intrinsic  structure  of  data  is  based  upon 
the  relation  instance,  it  would  seem  logical  to  base  our  primitive 
operations  upon  the  relation  instance,  also. 

The  form  of  the  relation  instance  is  d  r  p  where  deA,  reP,  and 
pc  IT.  We  may  define  our  primitive  operations  by  assigning  to  the 
three  fields  of  the  relation  instance  form  fixed  values,  variables, 
and  ’’don’t  cares”  in  various  combinations. 

The  operation  defined  by  assigning  some  particular  value  to 
each  field  of  the  relation  instance  form  simply  determines  whether 
or  not  the  resultant  relation  instance  is  true.  For  instance,  if  we 
assign  the  values  d.e  A,  r .e  P,  and  p^c  n  to  the  three  fields  of  the 
form  (in  that  order),  the  resultant  triple  d.  r.  p.  may  or  may  not 

1  ]  K 

constitute  a  valid  (or  true)  relation  instance.  This  operation  com¬ 
pares  the  triple  di  r^  Pj^  with  each  known  relation  instance.  If  a 
match  is  found,  the  value  of  the  operation  is  true;  otherwise  the 
value  is  false. 

For  ease  of  reference  we  will  characterize  this  operation  by 

the  triple  d.  r.  p,  ,  which  we  will  call  its  prototype. 

1  ]  k  - 

Suppose  we  assign  the  values  d^c  A  and  r^e  Pto  the  first  two 
fields  of  the  form  and  a  variable  to  the  last  field,  which  results  in 
the  prototype  d.  r.  -  where  the  ”  -  ”  represents  a  variable.  The 


132 


operation  defined  in  this  manner  determines  all  targets  p  which 


satisfy  d^  r.  p.  We  assume  that  the  source  d^  is  associated  with 
the  relation  symbol  r..  Therefore,  this  operation  will  find  at  least 
one  target  satisfying  d.  r.  p.  We  can  define  two  other  operations 

J 

with  prototypes  d.  -  p^  and  -  r.  ^  respectively,  which  perform 
similar  functions. 


Instead  of  assigning  a  variable  to  the  third  field  of  the  form  as 


above,  we  might  assign  a  ’’don’t  care",  which  results  in  the  proto¬ 
type  di  Tj  *  ,where  the  represents  the  "don’t  care".  This 
operation  determines  whether  or  not  the  source  d^^  is  associated  with 
the  relation  symbol  rj  and  has  a  value  true  or  false,  accordingly. 
Again  we  can  define  two  similar  operations  with  prototypes  d.*  p^ 
and  *  r.  pk,  respectively. 

If  we  assigr  a  particular  value  to  but  a  single  field  of  the  relation 
instance  form,  we  have  several  other  possibilities  available  to  us. 
The  prototype  -  r^  -  defines  an  operation  which  determines  all 
source/target  pairs  (d,p)  such  that  d  r .  p  is  true.  The  prototypes 

J 

d.  -  -  and  -  -  ^  define  similar  operations.  The  prototype  -  r.*  defines 
an  operation  which  determines  all  sources  d  associated  with  the 


relation  symbol  r„  Similar  to  this  operation  aie  those  with  the 
following  prototypes:  *  r.  -,  d.  -  cl  *  and  *  -  p^. 


All  of  these  operations  are  summarized  in  Table  4-1. 
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Operation 


Code 

Prototype 

Description 

di  ripk 

IS  d.  r . 

true? 

q„ 

dirj* 

Is  d.  associated  with  r.? 

2 

i 

J 

«3 

Vr 

Determine  all  p  such  that  d..  r^  p. 

O 

*  '-jPk 

Is  p,K  associated  with  r^  ? 

«5 

-  ri^ 

Determine  all  d  such  that  d  r^  p^. 

di  *Pk 

Is  p^  associated  with  di  ? 

Q7 

d.  -  p. 
l 

Determine 

all  r  such  that  d.  r  p^. 

% 

-  r.  * 

1 

Determine 

all  d  associated  with  r^. 

% 

*  r.  - 
1 

Determine 

all  p  associated  with  r.. 

] 

Q10 

-  r.  - 
J 

Determine 

all  (d,p)  such  that  d  r^  p. 

«11 

d.  -  * 

l 

Determine 

all  r  associated  with  d^ 

^12 

v- 

Determine 

all  p  associated  with  d^. 

«13 

d.  -  - 

l 

Determine 

all  (r,p)  such  that  d.  r  p. 

Q14 

-  *Pk 

Determine 

all  d  associated  with  p^. 

\5 

*  -pk 

Determine 

all  r  associated  with  p^. 

^16 

-  -  Pk 

Determine 

all  (d,  r)  such  that  d  r  p^. 

*  indicates 

"don’t  care" 

-  indicates  variable  field 
Table  4-1.  Primitive  Operations 
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We  could  have  defined  an  operation  with  the  prototype  — ,  that 
is,  one  with  variables  in  all  positions  of  the  relation  instance  form, 
but  such  an  operation  would  provide  all  the  relation  instances  of  the 
structure  and  is, therefore, not  very  enlightening.  Similarly,  we 
could  have  defined  operations  with  the  prototypes  d.**,  *r.+  ,  and 
**Pk,  respectively,  but  since  we  assume  that  d.,  r^,  and  p^  must 
appear  somewhere  in  the  structure,  these  operations  provide  no 
information  at  all. 

Clearly,  we  still  have  not  exhausted  all  the  possibilities  for 
operations  which  may  be  defined  using  the  form  d  r  p.  We  have 
certainly  considered  all  possible  operations  for  which  the  fixed 
quantities  are  constrainted  to  be  single-valued,  but  we  could  define 
similar  operations  for  which  the  fixed  quanities  are  allowed  to  be 
mutli-valued.  For  instance,  we  could  define  an  operation  which 
determines  all  targets  shared  by  some  set  of  source/relation  symbol 
pairs.  We  choose  not  to  do  this,  however,  for  the  same  reason 
we  are  not  considering  manipulative  operations.  Nonetheless,  the 
sixteen  primitive  operations  which  we  have  defined  should  be  repre¬ 
sentative  of  most  (interrogative)  operations  we  might  want  to 
consider. 

We  are  now  faced  with  the  problem  of  determining  the  time  cost 
t.  fo’-  each  primitive  operation  Q.  where  ie{l,  2,  •  •  •  ,16}. 
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But  in  order  to  determine  t^  we  need  some  algorithm  or  pro¬ 
cedure  which  describes  the  implementation  of  operation  Q^.  Clearly, 
there  does  not  exist  any  such  unique  procedure. 


Consider  for  example  operation  Q^: 


d.  r. 
x  l 


ipk- 


For  this  operation 


we  may  define  four  basic  procedures: 


(1)  Find  ci  in  the  storage  structure.  Examine  all  relation 

symbols  associated  with  d.  in  search  of  r..  If  r.  is 

J  1 

found,  examine  all  targets  in  the  corresponding  target 


set  in  search  of  p^. 

(2)  Find  r.  (the  relation,  not  just  a  relation  symbol)  in  the 

1 

storage  structure.  Examine  all  sources  associated 


with  r.  in  search  of  d..  If  d.  is  found,  examine  all 
J  xi’ 

targets  in  the  corresponding  target  set  in  search  of  p 


(3)  Find  r.  in  the  storage  structure.  Examine  all  targets 
j 

associated  with  r.  in  starch  of  p^.  If  is  founa, 
examine  all  sources  associated  with  the  appropriate 


relation  symbol/target  set  pair  in  search  of  d. . 

(4)  Find  in  the  storage  structure.  Examine  all  relation 

symbols  associated  with  in  search  of  r..  If  r.  is 

1  J 

found,  examine  all  sources  associated  with  the  appro¬ 
priate  relation  symbol-target  set  pair  in  search  of  a.. 


Even  each  of  these  basic  procedures  is  subject  to  some  variation. 

For  instance,  the  process  of  searching  for  r^  given  di  can  be  imple¬ 
mented  in  a  number  of  different  ways  (although  the  differences  may 
be  slight).  However,  for  our  purposes  we  will  assume  that  each  of 
the  basic  procedures  has  some  unique  implementation.  We  will 
choose  one  which  we  feel  is  representative  of  all  the  implementations 
of  the  given  basic  procedure  and  let  that  implementation  be  the 
"unique”  one. 

Each  of  our  primitive  operations  will  then  be  characterized  by  a 
number  of  "unique"  basic  procedures  which  we  will  call  methods  . 
Operation  Q^d 

instance.  Descriptions  of  the  methods  for  each  of  the  primitive  oper¬ 
ations  are  contained  in  Appendix  A. 

Let  us  introduce  a  simple  notational  scheme  for  describing  the 
methods  of  an  operation.  Let  us  denote  the  steps  in  a  method  by  a 
sequence  of  symbols,  one  for  each  step,  from  left  to  right.  Let  the 
steps  "Search  for  the  source  d.",  "Search  for  the  relation  (or  relation 
symbol)  r/',  and  "Search  for  the  target  p^"  be  denoted  by  the  symbols 

AAA 

d,  r,  and  p,  respectively.  Let  the  steps  "Determine  all  sources", 
"Determine  all  relations  (or  relation  symbols)",  and  "Determine  all 
targets"  be  denoted  by  the  symbols  d,  r,  and  p,  respectively.  The 

A  A 

sequence  d  r  p  would  then  be  interpreted  as:  (1)  Search  for  the 


r.  p^  is  characterized  by  four  methods,  for 

J 
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source  d. ,  (2)  Determine  all  relation  symbols  (associated  with  the 
source  d^) ,  and  (3)  Search  for  the  target  (among  those  targets 
associated  with  the  source  d^  and  the  relation  symbol  currently 
under  scrutiny).  Using  this  notation  we  have  summarized  the 
methods  for  each  of  the  primitive  operations  in  Table  4-2. 

In  general,  each  of  the  methods  of  a  given  primitive  operation 
will  have  a  different  time  cost.  Let  the  time  cost  for  method  j  of 
operation  be  represented  by  t„.  We  will  then  define  the  time  cost 
t.  of  operation  Q.  by 

t  _  min  fc  ie[l,2, . . . ,  16} 

1  J  ij  je{l,2,...,SiJ 

where  Sj  represents  the  number  of  methods  for  operation 

Thus,  to  determine  the  time  cost  t  for  operation  Q^,  we  must 
determine  the  time  costt^  for  each  method  of  the  operation  and  then 
choose  that  method  which  has  the  smallest  time  cost  to  represent 
the  operation 

4.2.2  Elementary  Time  Costs 

In  performing  a  given  primitive  operation  (via  any  of  its  methods) 

upon  a  storage  structure  we  will  find  it  necessary  to  trace  our  way 

from  one  point  in  the  structure  to  another.  In  general,  this  will  give 

involve  following  pointers  in  rings  and  sequencing  through  stacks. 

In  order  to  determine  the  time  cost  of  the  operation  we  may, 
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Operation 

Method  1 

Method  2 

Method  3 

AAA 

AAA 

AAA 

Q1:  VjPk 

d  r  p 

r  d  p 

r  p  d 

\  A 

A  A 

Q2:  dirj* 

d  r 

r  d 

A  A 

A  A 

A  A 

Q3:  d^  - 

d  r  p 

r  d  p 

p  r  d 

A  A 

A  A 

Q4:  *rjPk 

r  p 

P  r 

A  A 

A  A 

A  A 

Q5:  -rjPk 

r  p  d 

p  r  d 

d  r  p 

A  A 

A  A 

Q6:  VPk 

d  p 

P  d 

A  A 

A  A 

A  A 

Q7:  di_l\ 

d  r  p 

p  r  d 

r  d  p 

A 

A 

Q8:  '  V 

r  d 

d  r 

\ 

A 

S:  *  rj  - 

r  p 

P  r 

A 

A 

A 

Qio:  -rj‘ 

r  d  p 

r  p  d 

d  r  p 

A 

A 

Qu:  dj  -  • 

d  r 

r  d 

A 

A 

«12:  di*  - 

d  p 

P  d 

A 

A 

«13:  di" 

d  r  p 

r  d  p 

A 

A 

Q14:  -*»k 

P  d 

d  P 

A 

A 

Q15:  *'Pk 

P  r 

r  p 

A 

A 

Q16:  "Pk 

p  r  d 

r  p  d 

Table  4-2.  Summary  of  Methods  of  Implementation 
for  Primitive  Operations 


Method  4 


AAA 

p  r  d 


A  A 

r  p  d 


P 
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therefore,  wish  to  know  the  amount  of  time  required  to  access  one 
block  in  the  structure  from  another.  As  a  result,  we  will  intro¬ 
duce  a  number  of  elementary  time  costs  ,  each  of  which  represents 
the  amount  of  time  required  to  access  one  particular  type  of  block 
in  the  storage  structure  from  another. 

Before  beginning  consideration  of  these  elementary  time  costs, 

let  us  clarify  somewhat  the  concept  of  tracing  through  a  storage 
structure.  We  assume  that  there  exists  a  position  indicator,  or 

simply  an  indicator,  which  contains  the  address  of  that  field  (in 
some  block)  currently  of  interest  or  under  consideration.  The  indi¬ 
cator  is  clearly  just  a  pointer  to  the  field  of  interest.  As  our  inter¬ 
est  shifts  from  point  to  point  within  the  structure,  the  value  of  the 
indicator  changes  to  reflect  this.  More  correctly,  the  value  of  the 
indicator  changes  to  reflect  the  effects  of  operations  applied  to  the 
structure  to  elicit  information  from  it,  and  our  interest  shifts 
accordingly.  Tracing  through  a  storage  structure  simply  amounts 
to  stepping  the  indicator  along  certain  paths  of  access  within  the 
structure  as  required  by  operations  applied  to  the  structure. 

Let  I  represent  the  current  value  of  the  indicator.  I  is  then 
equal  to  the  address  of  the  field  of  current  interest.  Let  (I)  represent 
the  contents,  or  value,  of  the  field  whose  address  is  I. 

We  will  now  define  a  number  of  very  basic  quantities  to  be  used 
in  the  formulation  of  the  elementary  time  cost  expressions.  Let  f 
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represent  the  number  of  time  units  required  to  follow  a  forward 
pointer  from  one  block  to  another  in  an  airing  erf  the  SSM,  where 
ie  {1 , 2,  •  •  • ,  10} .  f .  is  then  the  number  of  time  units  required  to 
replace  I  by  (I),  where  I  is  the  address  of  a  forward  pointer  field 
in  some  block  of  an  airing,  f.  is  of  interest,  of  course,  only  if 

v1- 

Let  sA  represent  the  number  of  time  units  required  to  step  or 
sequence  from  one  block  to  another  through  a  stack  of  blocks  in  an 
airing  of  the  SSM,  where  i*  {1 , 2,  •  •  • ,  10}.  is  then  the  number 
of  b.me  units  required  to  add  (or  subtract)  some  displacement  D  to 
I,  where  1  is  the  address  of  an  arbitrary  field  in  some  block  of  an 
airing.  Clearly,  si  is  of  interest  only  if  0.  =  0  and  Ai  =  0. 

Let  hj  represent  the  number  of  time  units  required  to  follow  a 
head  pointer  from  an  element  block  to  the  head  of  an  airing  of  the 
SSM,  where  ie{l,  2,  •  •  •  ,10}.  In  is  then  the  number  of  time  units 
required  to  replace  I  by  (I),  where  I  is  the  address  of  a  head  pointer 
field  in  some  element  block  of  an  a^-ring.  Of  course,  h^  is  of 
interest  only  if  0?  =  1. 

Finally,  let  sQ  represent  the  number  of  time  units  required  to 
step  from  one  pointer  field  of  a  given  block  to  another  pointer  field  of 
that  block.  For  example,  in  tracing  through  a  structure  we  might 
enter  a  particular  block  via  one  ring  and  desire  to  continue  our 
tracing  with  the  second  ring  which  passes  through  the  block.  In 
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order  to  do  this,  we  must  add  (or  subtract)  some  displacement  D  to 
I,  where  I  is  the  address  of  the  pointer  field  in  the  first  ring  and 
I  ±  Dis  the  address  of  the  pointer  field  in  the  second  ring.  sq  reflects 
the  time  required  to  carry  out  this  addition  (or  subtraction).  Note 

that  the  second  pointer  field  may  contain  a  head  pointer  instead  of  a 
ring  (forward)  pointer,  but  the  process  to  be  carried  out  is  essen¬ 
tially  the  san.°. 

To  be  completely  general  we  should  also  define  some  quantity 
to  represent  the  number  of  time  units  required  to  follow  a  reverse 
pointer  in  each  of  the  rings  of  the  SSM.  However,  reverse  pointers 
are  generally  useful  only  for  the  manipulative  operations  which  we 
might  define  and  since  we  are  not  considering  manipulative  operations 
(and  since  reverse  pointers  will  not  be  useful  for  our  interrogative 
operations),  we  shall  assume  that  the  structures  we  consider  contain 
no  reverse  pointers  (i.e. ,  07  =0  for  all  ie{l,  2,  •  •  •  ,10}  ),  Hence, 
we  have  no  need  for  the  quantity  mentioned. 

We  might  question  the  usefulness  of  defining  a  separate  quantity 
f^  (or  si  or  la)  for  each  ring  of  the'  SSM.  There  are  two  reasons  for 
doing  so.  The  first  is  for  the  sake  of  generality,  and  the  second 
is  for  convenience  when  it  comes  time  to  develop  expressions  for 
the  elementary  time  costs  (we  can  tell  at  a  glance  which  a^-rings  of 
the.  SSM  are  under  consideration). 

Realistically,  it  would  be  quite  unusual  if  s^,  s^,  s^,  •  •  • ,  s^ 

were  not  al1  equal.  The  same  may  in  general  be  said  of  f.  and  h. . 
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In  fact,  fj.  and  h^  are  probably  equal  to  one  another  for  all  values  of 
i,  also.  It  is  possible,  however,  for  pointers  to  be  stored  in 

different  size  fields  or  for  them  to  be  located  in  different  relative 
portions  of  a  storage  unit  or  for  some  to  require  shifting  before  use 
and  so  forth.  All  of  these  factors  can  contribute  to  differences  in 

the  times  arising  from  their  use.  Furthermore,  instead  of  actual 
pointers  we  may  want  random  pointers  in  some  rings.  As  we  indi¬ 
cated  earlier,  the  uses  of  these  two  types  of  pointers  will  in  general 
be  characterized  by  different  amounts  of  time. 

We  will  make  two  assumptions  regarding  the  various  quantities 
f.,  si,  and  h.  First  we  will  assume  that  the  time  required  to  follow 
a  pointer  is  always  at  least  as  great  as  the  time  to  step  through  a 
stack.  Thus,  <  f^  and  s^  <  h^  for  ie  {1, 2,  •  •  • ,  10}.  Since  following 
a  pointer  will  always  involve  a  storage  reference  whereas  stepping 
through  a  stack  need  not,  this  is  a  reasonable  assumption  to  make. 
Second,  we  will  assume  that  the  time  required  to  follow  a  head 
pointer  will  never  be  less  than  the  time  to  follow  a  forward  pointer. 
Hence,  f^<  In  for  ie{l,2,  •  •  •  ,10}.  This  implies  that  if  either  the 
forward  pointer  or  the  head  pointer  must  be  subject  to  additional 
processing  (because  of  different  size  fields,  etc.)  the  head  pointer 
will  be  chosen  for  this  additional  processing.  Since  forward  pointers 
will  in  general  be  used  more  frequently  than  head  pointers  (when 
both  appear  in  the  same  ring),  this  is  a  reasonable  assumption  to 
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make.  Combining  these  two  assumptions,  we  can  write 
si  <  L  <  h.  for  ie{l,2,**',10}. 

(We  note  that  this  assumption  is  not  crucial  to  the  development  of 
the  time  cost  function,  but  reflects  simply  a  condition  which 
generally  exists.) 

Let  us  return  now  to  consideration  of  the  elementary  time  costs 
in  which  we  have  expressed  some  interest  but  which  we  hav°  not  yet 
defined.  These  elementary  time  costs  will  be  divided  into  three 
classes  as  follows:  (1)  those  which  reflect  the  number  of  time  units 
required  to  move  (i.  e. ,  trace  our  way)  from  one  element  block  of  a 
ring  of  the  SSM  to  another  element  block  of  that  ring,  (2)  those  which 
reflect  the  number  of  time  units  required  to  move  from  an  (arbitrary) 
element  block  of  a  ring  to  the  head  of  that  ring,  and  (3)  those  which 
reflect  the  number  of  time  units  required  to  move  from  the  head  of  a 
ring  to  the  first  elemen;  block  of  that  ring  or  vice  versa. 

Let  e^  represent  the  number  of  time  units  required  to  move  from 
one  element  block  of  an  a. -ring  of  the  SSM  to  another  el  .merit  block 
of  that  ring,  where  ie{l,  2,  *  •  *  ,10}. 

Consider  for  example  e^,  the  number  of  time  units  required  to 
move  from  one  b^ -block  in  an  a^-ring  of  the  SSM  to  another  -block 
in  that  ring. 
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If  0g  =  1,  the  airings  are  indeed  explicit  rings  and  we  need  only 
follow  a  forward  pointer  to  move  from  one  bg -block  to  another. 

Hence,  in  this  case  eA  =  f„. 

On  the  other  hand  if  02  =0,  there  exist  a  number  of  possibilities . 
If  a2=1,  each  airing  (which  is  not  really  a  ring,  of  course)  may  be 
viewed  as  containing  a  single  b^ -block.  We  will  assume,  therefore, 

that  e2=  0. 

If  a2=  0,  we  are  again  faced  with  a  number  of  alternatives. 
Suppose  0g=  1.  Then  the  b^-blocks  of  a  given  airing  will  be  stacked 
upon  the  corresponding  b2-block,  and  e2=  Sg. 

A  more  complex  situation  could  arise  if  instead  03=  0,  04  =  0, 

1  j  Ag=  1 ,  and  a4=  0,  which  results  in  the  situation  depicted  in 
Figure  4-3  (where  type  fields  have  been  included  in  each  block  for 
clarity).  In  this  case,  the  b^-blocks  of  each  airing  are  stacked 
upon  the  corresponding  b4~block,  which  is  duplicated  upon  each  b3~ 
block  of  the  associated  a^-ring.  The  b^ -blocks  of  each  a2-ring  are 
in  turn  stacked  upon  the  corresponding  bg-block. 

Since  it  is  possible  for  the  b3 -blocks  or  the  b5 -blocks  or  both 
to  contain  head  pointers  to  bg-blocks  and  b4~blocks,  respectively, 
onu  since  these  head  pointers  may  be  used  to  advantage  if  we  wish  to 
move_up_  the  structure  for  Figure  4-3,  it  now  becomes  important  to 
know  whether  we  are  moving  from  some  bg -block  to  the  bg -block 
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above  it  or  to  the  bg -block  below  it.  This  will  be  determined  by  the 
general  direction  in  which  we  are  moving  through  the  storage  structure 
as  a  whole.  For  instance,  if  we  are  tracing  through  the  structure 
from  a  b^ -block  representing  some  source  in  search  of  some  bg- 
block  representing  a  relation  symbol,  we  are  moving  downward  from 
a  given  bg -block  to  the  one  below  it,  and  vice  versa. 

Let  6  i  be  a  binary -valued  variable,  the  value  of  which  indicates 
whether  movement  is  toward  (Cpl)  or  away  from  (SpO)  the  head 
of  an  airing  when  the  element  blocks  of  that  ring  are  stacked  upon 
the  head,  where  ie{l,  2,  •  *  • ,  10}. 

Returning  to  consideration  of  e 2,  if  5  2=  1 »  we  can  take  advantage 

of  the  head  pointers  (if  present)  in  the  stack  of  bg-blocks  between 

consecutive  bg-blocks.  Thus,  if  62=  1  311(1  <t>\  =  1,  we  can  step  from 

a  b„ -block  to  the  first  bc  -block  on  the  bottom  of  the  stack  above  it 
o  o 

and  then  follow  the  head  pointer  in  the  b^-block  to  the  b^-biock  which 
acts  as  the  head  of  the  stack.  Since  there  is  a  bg -block  associated 
with  every  b^-block  in  the  structure,  we  may  treat  the  two  as 
indistinguishable  (i.e. ,  as  a  single  block).  Following  the  head  pointer 
then  brings  us  to  the  bg -block  desired.  It  follows  that  e2  is  given  by 

S2  +  ^4’ 

If  on  the  other  hand  6  2=  0  or  0^=  0,  we  must  step  from  the  bg- 
bloL-.  to  the  bg-block  adjacent  to  it  and  then  step  through  the  stack 
of  bg-blocks  of  which  there  are  k^.  In  this  case  s2  +  k^s^. 
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Thus,  if  0g=  0,  0g=  0,  04=  0,  0g=  1,  A 2=  0,  A^=  1,  and  A^-  0, 
then  e2  will  be  given  by  the  following  expression: 

e2=  s2  +  04  6  2  h4  +  ^4  +  k4S4 
where  the  "+"  between  the  two  decision  variables  0^  and  62  is 
treated  essentially  as  disjunction.  That  is,  if  either  0^  or  62  or  both 
are  1,  then  the  value  of  0^  +  62  is  1;  otherwise  its  value  is  0.  We 
may  also  treat  the  product  0^6  2  as  conjunction,  although  both  the 
arithmetic  product  and  the  logical  product  yield  the  same  result. 

To  be  consistent.,  we  will  treat  products  and  sums  of  deciscion 
variables  (which  are  represented  by  Greek  letters)  as  conjunction  and 
disjunction,  respectively.  The  result  of  these  operationo  .vill  be 
either  a  1  or  a  0  which  will  then  be  treated  simply  as  an  integer  in 
any  arithmetic  operation  (specifically,  an  arithmetic  product). 

For  example  ,  in  the  above  expression  for  e2  the  term  0^  5  2h^ 
is  effectively  the  arithmetic  product  of  h  ^  and  a  1  or  a  0,  depending 
upon  the  result  of  the  logical  product  0^  6  2> 

Returning  once  again  to  our  consideration  of  e2,  suppose 

00=03  =04=05  =06=°.  07=1>  A2=A4=A6=0’  ^  a3=a5=1-  The  resultant 

storage  structure  will  be  similar  to  that  of  Figure  4-3  but  will  have 

stacks  of  b„-blocks  inserted  between  the  various  bc -blocks.  By 
7  b 

applying  reasoning  like  that  used  above,  we  generate  the  following 
expression  for  e2: 
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e2  =  s2  +  2  (h4+hg) 

+  040e62(h4  +keS6) 

+  (0406  +  6  2)  k4  (s4  +  k6S6) 

Implicit  in  this  expression  is  the  assumption  that  if  both  the  b<.-blocks 
and  the  b^-blocks  contain  head  pointers,  then  a  head  pointer  in  a  b^- 
block  points  to  the  head  pointer  in  the  b^-block  associated  with  the 
bg-block  which  is  the  (actual)  head  of  the  ag-ring  of  which  the  b^- 
block  is  a  member.  We  can  make  this  assumption  since  there  is 
a  one-to-one  correspondence  between  the  bg-blocks  and  the  bg-blocks 
and  we  treat  the  two  types  of  blocks  essentially  as  one.  If  we  did  not 
make  this  assumption,  we  would  have  to  step  the  indicator  from  the 
location  pointed  to  by  the  b^-block  head  pointer  to  that  location  con¬ 
taining  the  bg-block  head  pointer,  resulting  in  an  additional  time 

cost  s  to  be  added  to  h„. 
o  6 

We  can  now  continue  in  this  manner  until  we  have  considered 
all  combinations  of  the  deciscions  variables  which  affect  e0. 

Since  expressions  of  the  sort  encountered  above  occur  very 
frequently  in  deriving  expressions  for  the  various  e^,  we  will  define 
a  number  of  general  forms  as  follows: 


SjOiij) 


s.+  0.T  a  h 

i  \  i 


C01 


6.v  k. 

3) 
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SnO^.U  =  S  +  0J  0!  6  (b  +  h.  ) 

1  x  *  3  11_12  3  h  l2 

+  0’  0!  6.  (h  +k.  s.  ) 

1  2  3  1  l2  l2 

+  0!  0!  6  k  (s  +h  ) 

1  2  3  X1  1  *2 

+  (0!  0!  +  6.)  k  (s  -k  s  ) 
1  2  3  1  1  2  2 


S3(i>ll’i2'i3)  =  V  VlVlVVV 

+  W4*J  VVSV 


+  0!  01  0J  5  [h  +k  (s  +h  )] 

*1  l2  3  3  1  n  0  0 


1  2  *2  3 


+  0J  0\  0\  6.  [h  +k  (s  +k  s 

1  2  3  3  1  x2  *2  3  *3 

+  0!  0!  0:  6.  k  (s  +h  +h  ) 

1  2  l3  3  1  1  2  *3 

+  0!  0!  0!  6  ,  k.  (s.  +  h.  +k.  s.  ) 

*1  *2  *3  *1  *1  l2  *3  *3 

+  0!  0!  0!  6.k  [s  +k  (s  +h  )] 

1  2  X3  3  ll  *1  X2  l2  3 

+  (0!  0!  0!  +6  •)  k  [  s  +k  (s  +k  s  )] 

1  2  3  3  1  1  2  2  3  3 
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S  iA)  =  s.+  0!  0!  0!  0’  5  (h  *h  +h  -h  ) 

*  -  '  J  *  3  1  l2  3  l4  3  1  2  3  4 


+  0!  0!  0!  0’  8 .  (h.  +h.  +h.  4-k.  s.  ) 
ll  l2  *3  x4  3  ll  x2  *3  >4  *4 

+  01  0!  0!  0!  6 .  f  h.  +h.  4-k.  (s.  +h.  )] 
h  x2  *3  x4  3  ll  l2  *3  *3  l4 

+  0!  0!  0!  0!  5  .  [  h.  +h.  +k.  (s.  +k.  s.  )1 

h  x2  V4J1  ll  x2  *3  X3  *4  V 

+  0?  0!  0!  0!  6  .  f  h.  4-k.  (s.  +h.  +h.  )] 

1  x2  l3  *4  3  1  x2  X2  *3  *4 

*  \  \%x)] 

A\\VA^VVV\,1} 

+  0!  0:  0!  0!  6  .  {h  4-k.  [  s  4-k  (s  +k  s  )]  } 

1  x2  3  4  J  X1  X2  2  3  3  4  *4 

+  0j  0|  0.'  0J  6  .  k  (s  +h  +h.  +h.  ) 

1  x2  3  4  3  Xl  ll  X2  3  4 

+  0!  0!  0!  0)  6  .  k.  (s.  +h.  +h.  4-k.  s.  ) 

*1  x2  *3  4  3  V  *1  X2  *3  *4  V 

+  0!  0.'  0!  0!  5  •  k.  f  s.  +h.  +k.  (s.  +h.  )] 

X1  X2  *3  *4  3  V  ll  l2  l3  *3  V 

+  0!  0!  0’.  0?  5  .  k.  f  s.  +h.  +k.  (s.  +k.  s.  )1 
‘1  *2  ‘3  '4  J  >1  ‘1  l2  1 3  *3  ‘4  >4 

+  0!  0!  0!  0!  6  .  k.  f  s.  4-k.  (s.  +h.  +h.  )1 

X1  X2  *3  Y  3  ll  xl  l2  x2  l3  XA 

+  0!  0!  0:  0!  5 .  \  [  s  4-k.  (s  +h.  +k  s  )] 

1  2  3  4  3  ll  1  2  2  3  4  Y 

+  0!  0!  0’  0:  6  .  k  {s  4-k.  [  s  +k  (s  +h  )]  } 

1  2  3  4  3  xl  ll  2  2  3  x3  4 

+  (0!  0!  0!  0'  4-6  )  k  {s  4-k  [  s.  +k.  (s  4-k  s  )]  } 
1  2  3  4  3  1  1  2  2  2  3  *4  4 
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We  now  have  a  convenient  shorthand  notation.  Instead  of  having 
to  write 

e2=  S2+045  2h4+  ^4+32}  k4S4 
we  can  write  simply 

e2=  Sx(2,4) 

Similarly,  instead  of 

VS2+04  *662  (h4+h6> 

+  Ve,  6  2  k4(s4+lV 
+  G>\  0’6+  5  2)  k4(s4+k6s6) 

we  can  write 

e2=  S2(2,4,6) 

This  notation  has  been  employed  to  compile  a  complete  tabulation 
of  ei  for  all  ie  {1, 2,  •  •  • ,  10},  which  appears  in  Appendix  B. 

We  have  implicitly  assumed  in  the  derivation  of  these  expressions 
for  e^^  that  either  0^-  1  or  0^=  1  (or  both)  when  both  a^=  0  and  0. 
This  guarantees,  of  course,  that  Constraints  6  and  7  will  be  satisfied. 

There  are  two  reasons  for  making  this  assumption.  First,  this 
assumption  allows  us  to  use  the  same  expressions  for  the  various 
when  0  and  0^=0  as  when  either  1  or  1  or  both.  (The 
b^ -blocks  and  the  b^ -blocks  then  act  as  "stops"  in  the  structure.) 
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Second,  we  need  not  concern  ourselves  with  developing  expres¬ 


sions  for  the  e^  for  the  case  in  which  A  and  II  are  disjoint  or  the 
special  cases  in  which  A  and  H  are  not  disjoint.  Suppose  for  instance 
that  0  for  all  ie  {1, 2,  ”  ‘ ,  10},  A  and  n  are  not  disjoint,  and 
al=  a2~  ®  ^e  intrinsic  structure  of  our  data  is  such  that  d^r^dg, 

d2r2d3’  d3r3d4’ "  ’  diridi+l»  di+l  ri+l  di+2’  *  dn-lrn-ldn 

(andd.,  r.  d.  and  d.  r.  d.  are  not  both  true) ,  then  clearly  our 
\  *2  12  ^2*1 

expressions  for  the  various  e^  must  be  a  function  of  n,  which  item 
of  information  is  not  at  our  disposal. 

Basically  then,  our  reason  for  making  this  assumption  is  to 
avoid  deriving  expressions  for  the  e^  to  cover  each  of  several  possible 
special  cases. 

In  order  to  specify  the  value  of  6  i  to  be  assumed  for  a  particular 
use  of  e^,  we  will  follow  the  convention  that  a  prime  is  affixed  tc  e^ 
whenever  6^=0.  Thus, 


ei  implies  6  A=  1  for  ie  {1, 2,  •  •  • ,  10} 
e!  implies  5  p  0  for  ie  {1, 2,  *  •  *  ,10}  . 


We  will  continue  to  use  e.^  in  the  generic  sense,  also,  but  context 
should  make  the  usage  clear.  As  a  rule  of  thumb,  the  value  of  6^  is 
of  significance  only  when  e^  appears  in  a  time  expression.  Unless 
otherwise  specified,  all  other  uses  of  e.  are  in  the  generic  sense. 

The  quantity  e.  reflects  the  number  of  time  units  required  to 
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move  from  one  element  block  in  an  airing  to  another  element  block 
in  that  ring,  but  in  general  it  does  not  reflect  the  number  of  time 
units  required  to  move  from  the  element  block  nearest  the  head  to 
the  head  or  from  the  head  to  that  element  block. 

Let  e9  represent  the  number  of  time  units  required  to  move  from 
the  element  block  nearest  the  head  in  an  airing  of  the  SSM  to  the 
head  of  that  ring  or  from  the  head  to  that  element  block,  where 
ie  {1, 2,  •  •  •  ,10}.  Note  that  if  0.=  1  (i.e. ,  the  ring  contains  forward 
pointers),  the  element  block  nearest  the  head  when  moving  toward 
the  head  will  in  general  be  different  from  the  element  block  nearest 
the  head  when  moving  away.  This  will  not  affect  the  validity  of  the 
expression  we  will  derive,  however. 

Since,  when  A.=  1,  we  assume  the  duplicated  head  and  its 
associated  element  block  function  as  a  single  block,  we  define  e9  to 
be  0  when  A^=  1.  This  leaves  us  with  two  possibilities:  either  the 
airings  of  the  SSM  are  explicit  rings  with  forward  pointers  (if  0^=  1) 
or  the  element  blocks  are  stacked  upon  the  head  (if  0^=  0).  In  the 
first  case  e9  will  clearly  be  equal  to  L  and  in  the  second  case  to  s„ 
Recalling  that  s^<  L<  In,  we  see  that  even  if  0!=  1,  choosing  to 
follow  a  forward  pointer  or  to  step  through  the  stack,  as  the  case 
may  be,  is  still  preferable  to  following  the  head  pointer. 

Thus,  e9  will  be  given  by  the  following  expressions: 

e9  =  (0X  +  0-sp  for  ie  jl,  2,  •  •  •  ,10} 
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Finally,  let  e*  represent  the  number  of  time  units  required  to 
move  from  an  arbitrary  element  block  of  an  a.-ring  of  the  SSM  to  the 
head  of  that  ring,  where  U.  {1, 2,  *  *  *  10}. 

We  should  discuss  for  a  moment  what  we  mean  by  "an  arbitrary 
element  block".  In  tracing  through  a  storage  structure  we  may  enter 
a  ring  via  any  of  its  element  blocks  and  desire  to  move  to  its  head. 

If  there  are  k  element  blocks  in  the  ring  and  if  we  are  given  no  a 
priori  information  (which  we  assume  we  are  not),  then  we  are  equally 
likely  to  enter  the  ring  at  any  of  the  k  element  blocks.  That  is,  the 
probability  that  we  enter  the  ring  via  the  j-th  element  block  is  1A  for 
all  je  {1, 2,  •  *  *  ,k},  where  we  assume  that  the  element  blocks  of  the 
ring  are  numbered  consecutively  from  1  to  k  in  the  order  in  which 
they  appear  in  the  ring.  The  number  of  the  element  block  via  which 
we  expect  to  enter  the  ring  is  then  given  by  the  expectation  of  j,  which 
we  denote  E[  j]  . 


E[j] 


j 


=  1  k(k+l) 

H - 2~ 

k+1 
‘  2 
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As  a  matter  of  notational  convenience  let  us  define 


When  we  refer  to  "an  arbitrary  element  block",  it  is  this  block  -  the 

A 

k-th  element  block  -  to  which  we  refer. 

Let  us  now  consider  the  amount  of  time  required  to  move  from 

A 

the  k.-th  element  block  of  an  airing  of  the  SSM  to  the  head  of  that 
ring,  where  ie {1,2,- ••  ,10}. 

As  is  the  case  with  e?,  if  A  -  =  1 ,  we  define  e^  to  be  0. 

If  0!  =  1,  very  clearly  we  need  only  follow  the  head  pointer  in  the 
element  block  to  reach  the  head.  Then  j  .  =  h,. . 

A 

On  the  other  hand,  if  0?  =  0,  we  must  move  from  the  L-th  element 
block  through  all  intervening  element  blocks  in  order  to  reach  the 

A 

head.  We  know,  however,  that  moving  from  the  k.-th  block  to  the 
A 

k^-l-st  block  requires  e^  units  of  time,  as  does  moving  from  the 

A  A 

k^-l-st  block  to  the  k.-2-nd  block,  and  so  on  through  the  2-nd  block 
to  the  1-st  block.  Similarly,  we  know  chat  moving  from  the  1-st 
block  (the  block  nearest  the  head)  to  the  head  requires  tV  units  of 
time.  Thus,  a  total  of 

e?  +  (k.-l)  e. 
i  'ii 

A 

units  of  time  are  required  to  move  from  the  k.-th  element  block  to 


the  head. 
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Therefore,  e*  may  be  characterized  by  the  following  expression: 
e*.  =  A  {0!  h.  +  ?.  [  e°  +  ( k.-  1)  e.]  } 
for  ic {1, 2,*  •  '10}. 

4.  2. 3  Sub-procedures  and  Their  Time  Costs 

Upon  closer  inspection  of  our  definitions  for  the  primitive  opera¬ 
tions  and  their  methods  of  implementation,  we  see  that  each  method 
consists  of  a  number  of  sub-procedures,  many  of  which  are  common 
to  several  different  methods. 

For  instance,  one  of  these  sub-procedures  might  be  that  se¬ 
quence  of  steps  required  to  search  the  SSM  for  all  occurences  of 
a  particular  relation  symbol  associated  with  a  given  source.  Another 
might  be  that  sequence  of  steps  required  to  access  all  targets  asso¬ 
ciated  with  a  given  source/relation  s’  mbol  pair. 

Since  sub-procedures  such  at  these  appear  several  times  in  the 
definitions  of  the  various  methods  and  since  our  ultimate  goal  is  the 
determination  of  the  time  cost  for  each  method,  it  will  be  advanta¬ 
geous  to  develop  time  costs  for  the  sub- procedures. 

If  we  examine  in  detail  the  procedures  used  to  implement  the 

methods  for  the  various  primitive  operations  (as  given  in  Table  4-2 
and  Appendix  A),  we  see  that  they  involve  twenty  different  sub-pro¬ 
cedures  in  two  classes.  The  first  class  contains  ten  sub-procedures 

of  ♦he  form  "Determine  al^  x1  associated  with  x,  ’,  and  the  second 
class  contains  ten  sub-procedures  of  the  iorm  "Search  for  a  parti¬ 
cular  x,  associated  with  x0".  Let  a.  denote  the  i-th  sub-procedure 

-  1  ^  15?i 


in  the  first  class  and  let  a* denote  the  i-th  sub-procedure  in  the 
second  class,  where  ie  {l,  2, . . .  10}. 

Using  the  terminology  of  the  SSM  (where  we  refe-  to  parti¬ 
cular  blocks)  as  opposed  to  the  terminology  of  the  DSM  (where  we 
refer  to  sources,  targets,  and  relation  symbols),  let  us  define 

first  the  sub-procedures  £*•  for  ie  { 1 , 2,  •  •  • ,  10j . 

I.et  <y  represent  the  sequence  of  steps  required  to  access  all 

b.- blocks  associated  with  a  given  b^-block.  Similarly,  let 

represent  the  sequence  of  steps  requited  to  access  all  b^ -blocks 

associated  with  a  given  bg-block.  We  note  that  the  effects  of  and 

<*2  are  identical.  That  is,  the  same  bj -blocks  will  be  acce.  sed  by 

both  and  a 2  (assuming  of  course,  that  the  given  bg-block  and  the 

given  b^-block  are  associated  with  one  another).  The  time  costs  of 

these  two  sub-procedures  will  in  general  be  different,  however. 

Let  a ^  represent  the  sequence  of  steps  required  to  access  all 

b^ -blocks  associated  with  a  given  b- -block,  and  let  o ^  represent  the 

sequence  of  steps  required  to  access  all  b^ -blocks  associated  with 

a  given  bg-  lock.  Clearly,  a g  and  a ^  are  the  direct  analogies  of  aj 

and  a  ,  respectively,  for  tracing  a  storage  structure  in  the  other 
z 

direction. 

Let  c*p.  andffg  represent  those  sequences  of  steps  required  to 
access  all  -blocks  and  all  b^-bloeko,  respectively,  associated 
with  all  t^-blocks  which  represen*  *he  same  relation  symbol. 

Let  and  repiese^r  th.»se  sequences  of  steps  required  *o 


access  all  bg -blocks  associated  with  the  b^-block(s)  representing  a 
given  source  and  to  access  all  b,^  blocks  associated  with  the  b^- 
block(s)  representing  a  given  target,  respectively. 

Finally,  let  ctg  and  represent  those  sequences  of  steps 
required  to  access  all  b^-blccks  associated  with  the  b^-block(s) 
representing  a  given  source  and  to  access  all  b^-blocks  associated 
with  the  b^-block(s)  representing  a  given  target,  respectively. 

A  summary  of  these  sub-pr  icedure  descriptions  appears  in 
Table  4-3. 

Consider  now  the  sub-procedures  a’t  for  ie  {1, 2,  •  *  •  10}. 

Instead  of  accessing  all  of  the  blocks  of  a  given  type  which  are 
associated  with  some  block  or  blocks,  we  may  wish  to  access 
only  enough  of  these  blocks  to  find  a  particular  one.  If  there  are 
k  blocks  of  a  given  type  associated  with  some  block  of  another 
type,  we  would  expect  that  in  order  to  access  one  particular  block 

A 

of  the  k  blocks  we  must  access  on  the  average  k  blocks,  the  last 
one  accessed  being  the  block  sought.  An  example  of  this  mignt  be 
looking  for  the  bg-block  which  represents  a  particular  relation 
symbol  and  which  is  associated  "'<th  a  particular  b^-block. 

Clearly,  we  can  establish  a  one-to-one  correspondence 
between  the  sub-procedures  a ^  and  the  sub-procedures  ap  Thus, 
by  substituting  the  phrase  "a  particular"  for  the  word  "all",  we 
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f 


Sub-procedure  a  represents  the  sequence  ox  steps  required  to 
access  all  x.  associated  with  xrt. 

l  i 


a 

■  •  1  . . 

X1 

X2 

al 

b^  -blocks 

a  given  b^- block 

a2 

bj -blocks 

a  given  bg-block 

q3 

b^ -blocks 

a  given  b,. -block 

a4 

b^  -blocks 

a  given  bg-block 

a5 

bj -blocks 

all  bfi-blocks  which  represent  the 
same  relation  symbol 

a6 

b^ -blocks 

all  bg-blocks  which  represent  the 
same0  relation  symbol 

q7 

bc -blocks 

0 

the  bjblock(s)  which  represent  a 
given  source 

a8 

b^-blocks 

the  b^  -block(s)  which  represent  a 
given  Target 

a9 

b^ -blocks 

the  bt  -block(s)  which  represent  a 
given  source 

a10 

bj -blocks 

the  b^-block(s)  which  represent  a 
given  target 

Table  4-3.  Summary  of  Sub- Procedure  Descriptions 
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may  define  sub-procedure  with  the  same  statement  used  to  define 
sub-procedure  ai  for  ie  {i,  2,  • . . ,  10} .  Also,  Table  4-3  may  be 
considered  a  summary  of  the  sub-procedures  af  if  "all"  is  re¬ 
placed  by  "a  particular"  in  the  statement  "Sub-procedure  a  repre¬ 
sents  the  sequence  of  steps  required  to  access  all  x^  associated 
with  X2." 

Let  the  functions  z.(t)  and  z*.  (t)  represent  the  number  of 
time  units  required  to  perform  a.  and  a*.,  respectively,  upon  the 
storage  structure  represented  by  the  SSM,  where  ie  { 1 , 2 ,  • . . ,  10}. 

The  variable  t,  which  is  used  as  an  argument  to  the  functions 
and  z1^ ,  represents  the  numbei  of  time  units  required  to  perform 
some  operation  upon  each  of  the  blocks  accessed  by  ai  and  a*,  as 
the  case  may  be. 

The  operations  to  which  t  refers  may  be  divided  into  two  classes: 
the  first  class  contains  those  operations  which  compare  the  contents 
of  some  field  of  a  block  with  some  given  quantity,  indicating  whether 
or  not  a  match  is  made  and  the  second  class  contains  those  operations 
which  fetch  (and  record  or  display)  the  contents  of  some  field  of  a  block. 

Before  considering  the  operations  in  each  of  these  two  classes. 


let  us  define  a  number  of  basic  quantities  much  as  we  did  for  L ,  s^, 
and  h..  Let  f  represent  the  number  of  time  units  required  to  follow 

1  cl 

a  description  block  indicator  from  a  b^ -block  of  the  SSM  to  the 

corresponding  description  block,  (f  is  of  importance  only  if  a.  -  1, 

a  l 


of  course.)  Similarly,  let  f  represent  the  number  of  time  units 
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required  to  follow  a  description  block  indicator  from  a  -block  of 
the  SSM  to  the  corresponding  description  block. 


Let  F  ,  F  ,  and  F  represent  the  number  of  time  units  required 
a  r’  p 

to  follow  a  pointer  in  a  source  ring,  a  relation  ring,  and  a  target  ring, 
respectively. 

Let  c^  represent  the  number  of  time  units  required  to  compare 
a  data  item  description  (as  it  appears  in  a  description  block)  with  a 
known  or  given  quantity.  Similarly,  let  cr  represent  the  number  of 
time  units  required  to  compare  a  relation  symbol  name  with  a  given 
quantity. 

Finally,  let  v^  and  vr  represent  the  number  of  time  units 
required  to  fetch  a  data  item  description  and  a  relation  symbol 
name,  respectively. 

Return  now  to  consideration  of  the  two  classes  of  operations.  In 

the  first  class  we  will  define  five  operations  which  will  require  the 

following  respective  numbers  of  time  units  to  perform:  C  ,  C  ,  C  , 

3.  r  ^  r 

C  and  C  .  In  the  second  class  we  will  also  define  five  operations 
1 2  p 

which  will  require  V  ,  V  ,  V  ,  V  ,  and  V  units  of  time  to  perform, 

a  rl  r  r2  P 

respectively. 

C  and  V  represent  the  number  of  time  units  required  to  compare 

CL  d 

against  a  given  quantity  and  to  fetch,  respectively,  the  data  item 
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description  associated  with  a  given  fc^- block  of  the  SSM.  If  cr^=  0, 
the  description  block  associated  with  a  given  b^  -block  is  attached 
directly  to  the  b^ -block.  In  this  case  Ca=cd  and  Va=  v^.  On  the 
other  hand  if  cr^=  1,  we  must  follow  a  description  block  indicator  in 
order  to  reach  the  description  block  before  we  can  compare  or  record 
the  description.  Thus,  in  this  case  Ca=  fg+c^  and  Va=  fa+v^.  We  may 
then  characterize  C  and  V„  by  the  following  expressions: 


C  =a.  f  +  c  , 
ala  d 


V  =Oi  f  +vH 
ala  d 


Cp  and  Vp  represent  quantities  analogous  to  Ca 
b^ -blocks  of  the  SSM.  It  follows  that 


arid  V,  for  the 

a. 


p=  °2  fp 

=  ff0f 
P  2  p 


+  Cd 
+  Vd 


Cr  and  Vr  represent  the  number  of  time  units  required  to  compare 
against  a  given  quantity  and  to  fetch,  respectively,  the  relation 
symbol  name  associated  with  a  given  bg-block  of  the  SSM.  If  1, 
the  bg-block  contains  a  relation  symbol  name  field  so  that  Cr=  cr  and 
Vr=vr.  If  on  the  other  hand  p9=  0,  we  have  two  choices:.  (1)  if  p^  or 
p  is  1,  go  to  one  of  the  br-blocks  or  b_-biocks  associated  with  the 

O  i)  I 

bg-blocks  and  containing  a  relation  symbol  name  iield,  or  (2)  go  to 
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the  head  of  the  relation  ring  passing  through  the  bg-block  (which  we 
assume  contains  a  relation  symbol  name  field).  It  so  happens  that 
Cr  and  Vr  will  be  used  only  when  p^  =  0  or  p^=  0,  however.  In  parti¬ 
cular,  they  are  used  only  in  conjunction  with  the  sub-procedures  a 
c*g,  a *^,  a*g,  which  are  used  to  move  toward  the  "center"  of  the  struc¬ 
ture  in  search  of  one  or  more  relation  symbols.  These  sub-procedures 
first  encounter  either  the  4. -blocks  or  the  b,- -blocks.  If  these  first 
blocks  encountered  do  not  contain  a  relation  symbol  name  field,  then 
the  corresponding  bg-  block  must  be  accessed,  which  brings  us  to  the 
point  of  this  discussion.  Let  us  assume  that  and  p^  must  be  equal 
when  p£  =  0.  This  implies  that  if  only  one  type  of  block  (of 
the  bg -blocks,  bg-blocks,  and  b^- blocks)  contains  a  relation 
symbol  name  field,  then  the  bg-blocks  must  contain  that  field 
(i.e.,  then  p^  =0,  P2  =1,  and  p9  =0).  This  is  a  rather 
arbitrary  assumption  which  we  motivate  simply  on  the  grounds 
of  convenience  in  deriving  the  related  expressions.  It  does, 
however,  have  a  certain  intuitive  appeal  in  that  it  preserves 
some  of  the  symmetry  of  the  model.  This  assumption, 
coupled  with  the  fact  that  C'r  and  will  used  only  when 
p^  =  0  or  pg  =0,  rules  out  the  first  alternative  above. 
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To  reach  the  head  of  a  relation  ring,  as  suggested  by  the 

A 

second  alternative,  we  must  follow  mr  pointers.  (Since 

there  are  on  the  average  m  b.-blocks  which  represent  the 

r  b 

same  relation  symbol,  it  follows  that  there  are  mr  bg-blocks 
in  a  given  relation  ring.  We  assume  that  we  enter  this 
ring  via  an  arbitrary  bg-block  in  the  ring.  Hence,  we 

A 

must  follow  m^  pointers  to  reach  the  head.)  It  follows 

A  A 

that  C  =  m  F  +  c  and  V  =  m  F  +  v  .  We  may  then 
rrrr  rrrr  J 

characterize  Cr  and  Vr  by  the  following  expressions: 


rn  F  +  c 
r  r 


m  F  +  v 
r  r 


r 

r 


i"  and  V  represent  the  number  of  time  units  required  to 
A1  rl  • 

compare  against  a  given  quantity  and  to  fetch,  respectively,  the 

relation  symbol  nam  ■  associated  with  a  given  bg-block  of  the  SSM. 

If  Pi  =  1>  clearly  C  =  c  and  V  =  v  .  If  p-  =  0,  however,  we  will 
i  i  r  i 

move  to  the  bg-block  associated  with  the  given  bg-block  and  apply 
the  operaaons  which  give  rise  to  C  and  Vr-  We  know  that  the 
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amount  of  time  required  to  move  from  a  b,. -block  to  the  corresponding 

b„-block  is  given  by  e*_ .  Therefore ,  in  this  case  C  =  e*  +  C  and 
6  5  x-j  5  r 

V  =  e*  +  V  .  C  and  V  are  then  described  by  the  following 
rl  5  r  rl  rl 
expressions: 


Crj  cr  +  pl  (e5  +  Cr> 
Vr,  vr+^l  <e5  +  Vr> 


C  and  V  represent  quantities  analogous  to  C  and  V  for 
r2  r2  rl  rl 

the  b^-blocks  of  the  SSM.  It  follows  that 


"  p3  cr  +  p3  *e*6  +  Cr' 
=  p3  vr  +  P3  <e*6  +  vr) 


It  should  be  clear  that  only  certain  of  the  quantities  C  ,  C  ,  C  , 

a  r  j  r 

C  ,  C  ,  V  ,  V  ,V,V  ,  and  V  may  be  substituted  for  t  in  z.(t) 

r2  p  a  rj  r  r2  p  l 

and  zt  (t)  for  a  given  value  of  ie {1,2, ♦  •  • ,  10} .  For  example,  only 

C  and  V  apply,  when  i=  1. 
a  a 

Let  us  continue  now  with  our  consideration  of  the  functions  z^t) 

and  zt{t)  by  intially  considering  z.(t)  for  ie  {l,  2,  •  • . ,  10} . 

Consider  first  z^(t),  whict  represents  the  number  of  time  units 

required  to  access  all  b^ -blocks  associated  with  a  given  b^ -block.  If 

we  wish  to  compare  with  a  given  quantity  the  data  item  description 

associated  with  each  b, -block  accessed,  then  t  =  C  .  On  the  other 

1  a 


106 


wait! 


V 


hand,  if  we  wish  to  fetch  the  data  it.em  description  associated 
with  each  b. -block  accessed,  then  t  =V„.  In  either  event 
the  steps  required  to  access  the  b^ -blocks  will  be  the  same. 

We  may  outline  these  steps  as  follows: 

(1)  Move  to  the  bg -block  associated  with  the  given  b^ -block. 

(2)  Access  all  b^-blocks  associated  with  that  bg-block. 

(3)  Move  to  the  b^ -block  associated  with  each  b,. -block. 

(4)  Access  all  b^ -blocks  associated  with  each  b^-block. 

(5)  Move  to  the  b2~block  associated  with  each  b^-block. 

(6)  Access  all  b^ -blocks  associated  with  each  bg-block. 


Of  course,  depending  upon  what  transformations  have  been  applied  to 
the  SSM,  some  of  these  steps  may  net  have  to  oe  performed  (i.e. , 
the  time  required  to  perform  the  steps  may  be  zero). 

Let  us  examine  the  number  of  time  units  required  to  perform 
each  of  these  six  steps.  Step  (1)  will  clearly  require  e*g  units  ot  time 
to  perform. 

To  perform  step  (2)  we  must  move  from  the  b^-block  to  the  first 
b^-block  of  the  a^-ring  and  then  move  sequentially  through  the 
remaining  K^-l  bg-blocks  of  the  ring.  This  will  require  e°  +  e|. 

units  of  time.  We  have  assumed  that  the  last  element  block  of  the 
a^-ring  is  treated  exactly  like  the  other  element  blocks  in  that  we  try 
to  access  the  element  block  following  it.  Of  course,  there  is  no  such 
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block,  but  we  assume  the  time  required  to  determine  this  fact  is 
given  by  e^.  Hence,  we  multiply  e|.  by  Kgg  instead  of  by  Kgg-1. 

The  performance  of  step  (3)  normally  requires  e*4  units  of  time 

and  since  it  must  be  performed  once  for  every  b^-block  associated 

* 

with  the  given  b^-block,  the  total  time  required  is  Kgg  e  4  units. 

Step  (4)  is  similar  to  step  (2)  and  we  determine  that  it 
requires  e°  +  e^  units  of  time.  Since  it  also  must  be  performed 
once  for  every  b,--block,  the  total  time  required  is  K0„(e°  +  K00e’). 

Note,  however,  that  if  0g=O,  04=O,  0&=O,  andA5=0(A4=l  in 
this  case),  then  steps  (3)  and  (4)  are  included  in  step  (2).  If  both 
Ag=  1  and  A4=  1,  this  is  only  of  academic  interest  since  e*4  =  0, 
e®  =  0,  and  e^  =  0  anyway,  but  if  Ay=  0,  it  is  of  some  importance. 

Given  that  0„  =  0,  04=O,  0^=0,  Ag=0,  A4=l,  andA^=0,  we  know 
that  bg -blocks  are  stacked  upon  their  respective  b4-blocks,  b4-blocks 
are  duplicated  upon  their  associated  b,. -blocks,  and  b_ -blocks  are  in 
turn  stacked  upon  their  respective  bg-blocks.  Thus,  intervening 
between  each  pair  of  b,, -blocks  in  a  stack  is  a  stack  of  b„ -blocks 
(which  may,  depending  upon  the  values  assigned  to  other  decision 
variables,  have  stacks  of  blocks  between  pairs  of  them).  It  follows 
that  in  moving  from  one  b^-block  in  a  stack  to  the  succeeding  bg-block, 
we  must  step  through  each  of  the  intervening  bg -blocks.  (Because  we 
are  moving  away  from  the  heads  of  the  stacks,  we  may  not  take 
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advantage  c'  any  head  pointers,  of  course.)  Clearly  then,  step  (2) 
encompasses  steps  (3)  and  (4)  and  the  time  costs  of  steps  (3)  and  (4) 


may  be  disregarded.  This  also  justifies  our  assumption  that  e' 

D 

represents  the  time  required  to  determine  that  the  last  b^-block  in 

a  given  a^-ring  is  indeed  the  last. 

* 

Step  (5)  requires  e  2  units  cf  time  and  must  be  performed  once  for 
every  bg-block  associated  with  '.he  given  b^-block,  or  X32  times. 
Thus,  its  total  required  time  is,  e*2  units. 

Step  (6)  is  similar  to  steps  (2)  and  (4).  Its  total  required  time 
is  K32  <e°  ♦  Kn  ej). 

Note  that  if  <2^=0,  02=O,  03  =  U,  and  A3=0,  1Iien  stePs  (5)  311(1  (6) 
are  included  in  step  (4).  This  situation  parallels  that  for  the  inclu¬ 
sion  of  steps  (3)  and  (4)  in  step  (2)  exactly.  By  extending  our 
reasoning,  we  c?n  in  fact  show  that  if  steps  (5)  and  (6)  are  included 
in  step  (4)  and  J  step  (4)  is  in  turn  included  in  step  (2),  then  steps 
(5)  and  (6)  are  included  in  step  (2) . 

Finally,  for  each  b^ -block  accessed  we  perform  that  operation 
which  is  characterized  by  t. 

Before  summarizing  z^(t)  to  this  point,  let  us  define  a  binary - 
valued  inclusion  variable  X.  as  follows: 

Xi=0i-1+  0i+  0i+l+Ai-rl  f0r  iei2>4>6>8} 

xi  =0i-i  +  0i  +  0ui  +  Ai-i  for  iei3>5>7>9) 
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(Recall  our  assumption  that  "+"  indicates  disjunction  when  applied  to 
decision  variables.) 

We  may  now  write  the  following  expression  for  z^(t): 

zl<‘>  =e*6  +  e°5  +  K33e5 

+  X4[  K33e  4  +  K33(e“  +  K22e^)] 

+  X4^  K32e*2  +  K32*el+  Kllel^ 

+  K31  * 

If  we  rearrange  the  terms  of  this  expression  somewhat,  we  can 
obtain  the  expression 

Zl(t)  =  e*6  +  e°5  +  K33  [  ej.  +  X4(e  4  +  e°)] 

+  K32  f  X4e3  +  X2  2+e  1' - 
+  K31  [  X2eJ  +  t] 

or  alternatively 

zl(t)  =  e*6  +  e°5  +  V  e5+X4(e*4 +3°3>! 

+  k3VX4e3*X2(e2  +  e°l>J 
+  kik3k5[  x2ei  + 


This  expression  is  not  quite  complete,  however,  for  we  have 


ignored  so  far  quantities  such  as  the  number  of  time  units  required 
to  move  from  a  pointer  field  for  one  ring  to  a  pointer  field  for  another 
ring  passing  through  the  same  block  (when  such  a  situation  exists) . 

We  assume  that  when  is  to  be  performed,  the  position  indicator 
is  pointing  to  the  relation  symbol  name  field  of  a  given  b^ -block.  (If 
the  b,, -blocks  do  not  contain  a  relation  symbo1  name  field,  will  not 
be  used.)  If  0^-  1  or  0j.=  1,  we  must  step  from  the  relation  symbol 
name  field  to  either  the  head  pointer  field  or  the  forward  pointer 
field  of  the  given  b^-block  in  order  to  allow  access  to  the  correspond- 
ing  bg-block.  As  we  discussed  earlier,  this  will  require  sQ  units  of 
time.  Suppose  instead  that  Ag=  1  and  ek=  1.  In  this  case  we  must 
step  from  the  relation  symbol  name  field  of  the  b^-block  to  the  ai¬ 
ring  forward  pointer  field  of  the  corresponding  bg-block.  (Recall 
that  e  .  is  defined  to  be  zero  for  Ac  =  1  )  rhis  will  also  require  s 
units  of  time.  To  carry  this  one  m°~e  step,  suppose  Ag=  1,  Ag=  1, 
and  04=  1  or  0^  =  1.  Here  we  must  step  from  the  relation  symbol 
name  field  of  the  b^-block  to  either  the  a^-ring  head  pointer  field 
or  the  a^-ring  forward  pointer  field  of  the  corresponding  bg-block. 
There  is,  of  course,  only  one  bg-block  associated  with  the  given 
b^-block  in  this  case.  Also,  e*g  =  0,  eg=  0,  and  e^.  0.  As  before, 
this  move  requires  sQ  time  units.  Continuing  our  example  in  this 
manner  we  reach  the  point  where  Ag-  1 ,  Ag=  1,  A^=  1,  1,  Ag^  1, 
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and  A^=  1.  Clearly,  there  is  exactly  one  bj -block  associated  with 
the  given  b^ -block.  Furthermore,  e*g=0,  e°  =  0,  eg=0>  e*^=0,  e3=^> 
eg=  0,  e*2=  0,  e°  =  0,  and  e^=  0.  We  must  in  this  case  step  from  the 
relation  symbol  name  field  of  the  b^ -block  to  either  the  description 
block  indicator  or  the  description  block  itself,  as  the  case  may  be. 
This  also  requires  sq  time  units.  Thus,  whenever  the  following 
expression  has  a  value  of  1,  we  must  step  from  the  relation  symbol 
name  field  of  the  given  b^-block  to  some  other  field  in  the  structure 
a  move  which  requires  sq  time  units: 

<V  <#6>V  VsV  V5(V  *4>v  V5V3 

V  A„A5A4A,(02V  0'2)v  A6A5A4A3A20x 

V  A6A5A4A3A2A1 

(We  have  used  "V"  instead  of  to  denote  disjunction  here  because 
we  will  later  want  to  arithmetically  sum  the  values  of  expressions 
such  as  this.) 

Suppose  now  that  0^-1  and  0^=1.  If  0^=  0,  we  must  follow  the 
a^-ring  forward  pointers  to  reach  the  b^-block  associated  with  the 
given  b^-block.  In  addition  we  are  to  follow  the  a^-ring  forward 
pointers  to  access  all  b^-blocks  associated  with  that  bg-block.  Hence, 
at  the  bg-block  we  must  step  from  the  a^-ring  forward  pointer  field 
to  the  a^-ring  forward  pointer  field.  This  move,  as  we  know,  requires 
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sq  time  units.  If  0j,  =  1,  we  can  follow  the  ag-ring  head  pointer 
directly  to  the  ag-ring  forward  pointer  without  incurring  the  addi¬ 
tional  sQ  units  of  time.  Suppose  next  that  0g=  1,  0g=O,  Ag=  1,  and 
04=  1  or  0^=  1 .  In  this  case  we  must  step  from  the  ag-ring  forward 
pointer  to  either  the  a^-ring  head  pointer  or  the  airing  forward 
pointer.  Again,  sq  units  of  time  are  required.  If  we  continue  with 
this  line  of  reasoning,  we  can  obtain  another  expression 

V605V  WVV  04)v0606A5A403 

V  0606A5A4A3(02V02)V  0606A5A4A3A201 

which,  when  its  value  is  1,  implies  a  step  requiring  sq  units  of  time. 

Using  the  same  type  of  reasoning  for  cases  for  which  0g=  1,  we 
can  develop  the  expression 

®5(04V0i)V06A403V05A4A3  (02V0^l 

V  0gA4A3A201V  0gA4A3A2A1 

In  this  case,  however,  when  the  value  of  the  expression  is  1,  i:_  steps 
of  s  time  units  are  required.  This  follows,  of  course,  from  the  fact 
that  the  step  must  be  performed  for  each  of  the  kr  b. -blocks 
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associated  with  the  given  b^-block. 

If  we  carry  on  in  this  manner,  we  can  develop  several  more 
expressions  of  this  type.  Let  us  consolidate  all  of  these  expressions 
and  their  respective  multipliers  into  a  single  expression  which  we 
will  designate  z^  as  follows: 

z°=  {((0gV0g)  £005^6^(0^  ppv*  •  •  V  AgAgA^gAgAi) 

+< *6*6  V  W»(1 V  04> V  WsA403V  • ' ' V  V<S V4V2V 


+k5[(05(04V0pV05A403V05A4Aj(02V0yv...V05A4A342A1) 

+(04’?4(‘3V04?4A3(«i2V0yv040^A3A201V040^A3A2A1)] 

+k3k5^(,3*02V02*V'i3  '  2riV  03A2Al' 

+(020^V020^i)] 


+klk3k5t  Eo 

In  this  expression  'V  indicates  arithmetic  summation. 

z  j  indicates  the  number  of  time  units  by  which  our  current 
expression  for  z^(t)  is  deficient.  The  situation  is  easily  remedied: 
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zt(t)  =  e*6+  e°5  +  kg[  e|.  +  e°)] 

+  k3k5[  +  X2(t  /f  +  e°)] 

+  kik3k5t  X2el  +  1  ]  +  Z°1 

Consider  next  z^(  ),  which  represents  thi  number  of  time  units 
required  to  access  all  -blocks  associated  w  ch  a  given  bg-block. 
The  derivation  of  an  expression  for  Zg(t)  fo.lcws  almost  exactly  the 
derivation  of  the  expression  for  z^(t).  Th<.  only  difference  is  that  for 
z^(i)  there  is  no  need  (obviously)  to  move  from  a  b^-block  to  the 
bg-block.  Thus, 

zV^(05VA5'04V04*VA5A4ft3V  A5A4;3'02Vfi2'V' ' '  VA5A4A3A2A1^ 

+k5f<  W02V05A403V  05A4A3(02V0y  V-  •  •  V05A..A3A2A,, 

+<  ViV  VWV  02>V  040iA3A20lV  0404A3A2A1>] 

+k3k5[  (03(02V0yv03A201V03A2A1) 

+  (V201V02?2A1)] 


s 


o 
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and 


z2(t)=e°5+  k5t  e5  +  X4(CV  e°3>l 

+  k3ksf  X4C3  +  x2(e*2  +  e°l>l 
+  klk3kg[  1 1]  +  z°2 

In  deriving  z° ,  we  have  assumed  that  the  position  indicator  is 
intially  pointing  to  the  relation  symbol  name  field  of  the  given  bg- 
block.  If  pg=  0,  however,  we  may  assume  that  the  indicator  is 
pointing  to  the  relation  ring  pointer  field.  (Recall  that  the  relation 
ring  is  used  to  determine  the  relation  symbol  for  a  given  bg-block 
when  pg=  0.)  This  does  not  affect  the  validity  of  our  expression 
for  z° . 

Since  a 3  and  a ^  are  direct  analogies  of  and  a 2,  respectively, 
it  follows  that  z3(t)  and  z^(t)  parallel  exactly  z^t)  and  z2(t),  respec¬ 
tively.  The  actual  expressions  for  z3(t)  and  z^(t)  may  be  found  in 
Appendix  C  which  contains  a  complete  summary  of  z.(t)  and  z}n  or 
all  M{1,2,  * '  * ,  10} . 

Next  consider  the  derivation  of  an  expression  for  z,-(t),  which 
represents  the  number  of  time  units  required  to  access  all  b^ -blocks 
associated  with  all  bg-blocks  which  represent  the  same  relation 
symbol.  It  should  be  clear  that  can  be  performed  by  accessing 
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all  the  bg-block  in  a  given  relation  ring  and  then  applying  c?2  t0  each 
of  those  bg-blocks. 

Since  there  are  rnr  bg-blocks  in  a  given  relation  ring,  the  number 
of  time  units  required  to  access  these  bg-blocks  may  be  given  by 
(m  ,)  F  .  m  F  time  units  are  required  to  access  the  blocks,  and 
Fr  time  units  are  required  to  follow  the  pointer  from  the  last  bg-block 
in  the  relation  ring  to  the  head,  which  indicates  that  all  the  bg-blocks 
have  been  considered. 

It  follows  that 

z5(t)  =  (mr+1)  Fr  +  mrz2(t) 

Although  we  have  no  explicit  need  for  z°  ,  we  know  that  z°  =  m  z° . 

Zg(t)  is  analogous  to  z^(t)  and,  therefore,  will  not  be  considered 
in  detail  here.  An  expression  for  Zg(t)  appears  in  Appendix  C. 

Next  in  line  for  consideration  is  z^(t),  which  represents  the  number 
of  time  units  reouired  to  access  all  bg-b’ocks  associated  with  the 
bj-’olock(s)  which  represent  a  given  course.  By  applying  arguments 
similar  to  those  used  above,  we  can  obtain  the  following  expression: 

z?(t)  =  (ma+l)  Fa  +  ma  {e*  +  e  ° 

+  k2[  e2  +  x3<e  3  +  e4>] 

+  k2kJX3e4  +  4  ]  }  +  z°7 
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We  have  assumed  here  that  a  source  ring  is  used  to  access  the 
bj -block  or  b^ -blocks  representing  a  given  source.  If  we  should 
desire  to  assume  that  the  locations  of  these  b^ -blocks  are  known  a 
priori  (perhaps  as  the  result  of  some  other  operation),  we  may  set 
F  to  zero  and  still  use  the  given  expression. 

a 

Derivation  of  an  expression  for  z°  proceeds  along  lines  similar 
to  the  derivations  for  z° ,  z° ,  etc. ,  but  with  some  distinct  differences. 
We  assume  that  when  we  initially  enter  a  b^ -block  the  position  indi¬ 
cator  points  to  the  source  ring  pointer  fieid  of  that  block.  Our 
immediate  goal  is  to  trace  through  the  storage  structure  to  access 
all  bg-blocks  associated  with  the  given  -block  .  For  each  b^-block 
accessed,  either  we  are  to  compare  the  relation  symbol  which  it 
represents  with  some  known  quantity  or  we  are  to  fetch 
that  relation  symbol.  In  either  event  we  must  determine  what  the 
relation  symbol  is.  If  =  1 ,  we  may  determine  what  the  relation 
symbol  is  simply  by  accessing  the  bg-block.  On  the  other  hand 
if  p1  =  0  ,  we  must  access  the  corresponding  b„-bloci..  As  a  result, 

x  D 

our  expression  for  z°  will  appear  as  follows: 

z7  =  ma  «(*ilV'4i)VAl<,2VilA2(03V03)VAlA2A304 

V  ^1^2A3A4(plV05V0yVAl  A2A3,V4A5^1) 

+  (010102V  01 0i  A2(03V  03> V  01 01 A2A304 

V  01 0i  A2  A3  A4<V  V  05> V  01 01 A2  A3A4  Vl> 
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+  H  (02(03V  <9V  02A304V  02A3A4AV  05V  *5)V  Vs^Vl* 
*<030304V  03^A4(PjV  05V  0^) V030' a4a5Pj)] 

*k2k4  I  <04(P;V  V04A50l'  ■>•  }  s0 


To  clarify  the  intent  of  this  expression  let  us  briefly  examine 
the  terms 


A1^2^3A4^p1V05'/^VA1A2A3A4A5a1 

which  apply  when  A1  =  1,A^=<  A0~  1,  ana  A,=  1.  If  p.  =  1,  we  need 

X  £i  9  <J  *  X 

only  step  from  the  source  ring  pointer  field  of  the  given  b^ -block  to 
the  relation  symbol  name  field  of  the  associated  be -block  in  order  to 

J 

be  able  to  determine  what  the  relation  symbol  is.  On  the  other  hand 
if  0,  either  we  must  step  to  the  ac~ring  forward  pointer  feld  or 
head  pointer  field  of  the  d,. -block  (if  or  1)  In  order  to  access 
the  corresponding  b^-bjork  or  v»e  must  step  directly  to  the  relation 
symbol  name  field  or  the  relation  ring  pointer  field  of  the  bg- block 
(if 

Zg(t)  is  analogous  to  2^(1)  and  so  will  not  ba  considered  here  but 
may  be  found  in  Appendix  C. 

Zg(l),  vhich  represents  the  number  of  time  units  required  to 
access  all  b^- blocks  associated  with  the  bj-blocK(s)  which  represent 
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a  given  source,  is  similar  m  all  respects  to  those  z^(t)  considered 
above.  Without  any  difficulty  ve  can  derive  the  following  expression 
to  represent  it: 


z9<t)  =<ma+1,Fa  +  maK  +  e°2 

+  k2  [  +  X3(e*3+eO)] 

+  k2k 4  t  X3e4  +X5  {e  5  +  e°6)l 

+k2k4k6  ^  X5e6  +  X7^eVe8^' 

+  k2k4k6k8  f  X7e8+V6  9  +  e  10^ 

+  [  X0eJo  +  t]  }  +  z° 

where 

z°9  =  ma  {((01Ve>i)VA102 

+( Vl02v  V^2(fl3v  0^)v  •  ■  •  V  v;  a2A3A4A5A6A7A8A9A10) 

+Kl2[<V^V03)V02i3V02A3A4(V0;)V'‘'V0?'i3W6Y8^O) 
+<0303  V  V3 A4(05V05)V- ' '  V0303A4A5A6A1A8A9AlO)l 

+k.3(  (04(05v0')v04a506v04a5a6(07vo^)v.  •  ■  V04a5a6a?a8a9a1o) 
+(O50506V0505A6(O7Vli1*V'  ’  ’  V  °5°5A6A7A8A9A10^ 
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+K14[((S6(<S7v^)V06^V06V8(09V03V’"V06A7A8A9A1O) 

+  U~W  0,^A8(  V  0g)V ' '  -V070^8A9A1O)] 

-K1S[  (W09>VWlOV08A9AlO) 

+<090901OV  0909Alo'l 


+ki6[  V  >  s. 


Finally,  z1Q(t),  which  is  analogous  to  zg(t),  appears  in  Appendix  C. 
Let  us  now  consider  the  functions  z|  (t)  for  {1, 2,  •  •  •  10}. 

Consider  the  derivation  of  an  expression  for  z*  (t).  There  are 
k&  bg-blocks,  Kjkg  b^ -blocks,  and  kjkgkg  b^-blocks  associated  with  a 
given  b^-block.  (To  simplify  notation,  we  will  use  and  in 

place  of  kg,  i^kg,  and  k^kg,  respectively)  Because  of  the  uniform  dis¬ 
tribution  of  blocks  in  the  SSM,  we  can  say  in  general  that  when  we  have 

A 

accessed  Kgj  bj -blocks  associated  with  a  given  b7-block,  we  will  have 

A  A 

accessed  bg-blocks  and  Kggbg-blocks  associated  with  the  b7-block. 

Since  a*  andc^  are  so  closely  related,  one  would  expect  a  strong 
similarity  between  z*  (t)  and  z^t),  and  indeed  there  is.  Using  very 
much  the  same  arguments  we  used  for  z^t),  we  obtain  the  following 
expression  for  z  ^(t); 
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*  lW  =  e*6  +  e°5  +  (V1)  e5  +  ^3  x4(e*4  +  e°3> 

+  ^2”1'X4e3  +  2X2^e  2+e<P 

+  ("Sr1)  Vi +  ‘ +  z°i 

where  z°j  performs  the  same  function  for  z^  (t)  as  z°  does  for  z^(t). 

It  may  be  helpful  to  make  a  comment  or  two  about  this  expression 
Note  that  the  coefficients  of  ej.,  e^,  and  e^  are  of  the  form  k-1,  whereas 
in  the  expression  for  z^(t)  these  coefficients  are  of  the  form  k  (i.e. , 
they  are  not  decremented  by  1).  As  we  indicated  in  the  derivation  of 
the  expression  for  z^(t),  if  we  want  to  access  all  blocks  of  a  given 
type,  then  even  when  we  encounter  the  last  block  we  must  try  to 
access  one  more  in  order  to  determine  that  all  the  blocks  have  been 
considered.  If  there  are  k  such  blocks,  this  means  we  must  perform 
k  access  operations  (not  including  the  operation  required  to  access 
the  first  block  from  the  head),  which  accounts  for  coefficients  of  the 
form  k  in  the  expression  for  Zj(t).  If  on  the  other  hand  we  are  looking 
for  a  particular  block,  then  we  y  stop  accessing  blocks  once  we 
have  encountered  the  one  sought,  This  means  that  access  operations 
need  be  performed  only  for  those  biocxs  which  precede  the  block  sought. 
Thus,  if  the  block  sought  is  the  k-th  block  encountered  (of  a  given  type), 
we  need  perform  only  k-1  access  operations,  which  accounts  for  co- 
efficients  of  the  form  k-1  in  the  expression  for  z^(t). 
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Even  though  in  this  case  we  are  seeking  a  particular  b^ -block, 
this  reasoning  clearly  applies  to  the  bg-blocks  and  the  bg-blocks  at  the 
intermediate  levels,  also.  For  instance,  we  can  view  this  situation 
as  searching  for  the  particular  bg -block  (associated  with  the  given 
b^-block,  of  course)  with  which  the  b^  -block  sought  is  associated 
and  searching  for  the  particular  bg -block  with  which  the  b^  -block 
sought  is  associated. 

At  this  point  we  bring  up  a  note  of  caution:  the  expression  for 
Zj(t)  above  is  correct  as  far  as  it  goes,  but  it  is  incomplete. 

Suppose  for  instance  that  X^=0,  that  is,  e*4,  e°,  and  are 

A 

covered  by  ej..  Of  course,  ej.  applies  only  to  the  first  K^-l 
bg-blocks  encountered,  not  the  last  one. 

This  means  that  e*4  and  e^  are  required  once  to  account  for  the 
bg-blocks  associated  with  the  last  bg-block  encountered,  e^  will  also 
’>e  required  a  number  of  times  (to  be  determined)  to  account  for 

A 

these  bg-blocks.  Normally  (when  X^=  1)  e^  is  required  for 

A 

bg-blocks  but  (K}3_1)  kg  occurrences  of  e^  are  covered  by  e).  when 

A  A 

X^=  0.  Therefore,  we  will  require  (Kg2 
of  ejj  when  X4=  0. 


1)  -  (Kgg  -l)  kg  occurrences 


m 


,K32~V  -  OSs'1’  k3 


k„k,+l  k  +1 

(  g  ~  ^  K3 


k0k_-l  k0k_  -  k„ 

o  0  O  D  O 


k3-l 


A  A 

V 


Similarly,  e  ^  and  are  required  once  and  ej  is  required 

A 

times  when  Xg=  0. 

Taking  these  factors  into  account,  we  obtain  the  following  ex¬ 
pression  for  z^(t): 

z*(t)  =  e*6+e  5  *  +[  +  '1  <e4  +  e°3> 

+  (X4K32+X4K22~1^  6 3  +  f  X2^32  +  ^  ^2  +  el'; 

+  b&i  +  *Ai-l)  ei+hit  +  z°i 


The  expression  for  z0^  foLows  directly  from  the  expression  for 
z^.  All  operations  involved  remain  exactly  the  same;  only  the 


184 


numbers  of  blocks  are  different. 


*T  =  {<^6V  $  V  VsV  A6A5(04V ' 04>V ■  ■  ■ v  A6A5  A4A3  A2A1> 
+  <060605V  0606A5  (04V  04>  V0606A5A403V '  ‘ ' V  0606 


A3l  (05(04V  04>V05A403V  05A4A3(02V  02>V' '  ’ V  05A4A3  Vl> 
*(04^03v  040^3(02V0^  V040^a3  a20jV  040^a3a2a1)] 

Ail  <03(02V02>V03A201V03A2A1> 

+(020201V0202A1>] 

All  hi  H 

The  expressions  for  all  other  z*(t),  and  their  corresponding  z°* 
can  be  obtained  as  above  and  are  summarized  in  Appendix  C. 

4.2.4  Primitive  Operation  Time  Costs 

We  now  have  at  our  disposal  the  information  required  to  return 
to  our  consideration  of  the  time  costs  t..  associated  with  the  methods 
for  implementing  the  various  primitive  operations. 

Let  us  consider  the  derivation  of  an  expression  for  t^ ,  the 
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number  of  time  units  required  to  perform  using  its  first  method. 
Recall  the  basic  steps  required  to  implement  this  method: 

(1)  Find  di  in  the  storage  structure. 

(2)  Examine  all  relation  symbols  associated  with  di  in 

search  of  r.. 

] 

(3)  If  Tj  is  found,  examine  all  targets  in  the  corresponding 
target  set  in  search  of  pfc. 

Step  (1)  amounts  to  finding  the  head  of  the  source  ring  containing 
those  bj^ -blocks  which  represent  dy  We  will  assume  that  finding  the 
head  of  a  source  ring  requires  T  time  units.  (We  can  assume  i/mt 
this  step  is  accomplished  by  using  di  as  a  key  to  a  dictionary,  for 
instance.)  Since  we  will  have  need  of  these  quantities  later,  also 
assume  that  finding  the  head  of  a  relation  ring  or  a  target  ring 
requires  and  T  time  units,  respectively. 

If  we  have  assumed  that  F  =0  (i.  e. ,  that  we  do  not  use  the 
source  ring  to  access  the  b^ -blocks  representing  a  given  source,  but 
rather  we  know  a  priori  where  these  bj -blocks  are  located),  then  we 
may  assume  that  T  =  0,  also. 

Now  consider  steps  (2)  and  (3).  Suppose  -.hat  we  do  not  find  r. 
among  the  relation  symbols  associated  with  d..  This  means  that  we 
will  have  to  examine  all  b^-blocks  associated  with  the  -block(s) 
representing  d.  in  search  of  one  which  represents  the  relation  symbol 
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i-j,  but  without  success.  Of  course,  this  is  simply  sub-procedure 

arj,  which  has  a  time  cost  z^C^  ). 

Suppose  next  that  we  do  find  r.  but  not  pfc.  If  there  is  but  a  single 

copy  of  each  bg -block  associated  with  the  bj  -blocks  representing  d. 

(i.e. ,  if  m  =  1),  the  bR -block  representing  r.  may  be  found  via  a* 

o  J  t 

which  has  a  time  cost  z*  (C  ).  We  will  then  examine  via  a,  all 

i  r^  6 

b^ -blocks  associated  with  the  bg -block  representing  r^  in  search 
of  p  which  will  not  be  found.  The  time  cost  required  for  this  is 

,lv 

z„(C  ).  If  on  the  other  hand  m  >  1  we  must  examine  all  b^-blocks 
p  r^  o 

to  insure  that  we  have  found  all  those  representing  r„  Then  for  each 
bg-block  representing  r.  we  must  perform  a y  The  time  cost  required 
in  this  case  will  be  z^(Cr  )  +  mf  Zg(Cp). 

Finally,  suppose  that  we  find  both  r^  and  p^.  In  this  case  we  may 
stop  accessing  bg-blocks  as  soon  as  p^  is  found.  Thus,  the  time  cost 
required  to  find  r.  will  be  z*  (C  ).  Since  there  art  n  b_ -blocks 

j  I  r^  r^  3 

which  represent  r.  (and  which  are  associated  with  thf  b^-block(s) 

A 

representing  d.),  we  would  expect  p,  to  be  associated  with  the  m  -th 
i  -K  r^ 

be -block  representing  r..  This  means  that  for  this  b,. -block  we  need 
o  ]  o 

examine  only  part  of  the  b^  -blocks  associated  therewith  at  a  time 

*  A 

cost  Zg(C^),  but  for  the  first  mr  -1  bg-blocks  representing  r.  we 
must  examine  all  associated  b..  -blocks  at  a  time  cost  z 0(C  )  each. 

O  p 
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Implicit  in  this  discussion  has  been  the  assumption  that  p^  =  1. 

If  in  fact  pj=  0,  we  may  use  a ^  and  a in  place  r.f  a ^  and  a  g,  re¬ 
spectively.  The  reason  for  this,  of  course,  is  that  the  operation 

characterized  by  C  must  access  the  b„~block  associated  with  a 

ri  6 

given  bg -block  when  =  0,  and  we  may  use  this  bg-block  instead  of  the 
bg- block  as  our  starting  point  in  the  search  forp^.  Clearly,  z^(C^) 
and  z*(C  )  will  then  be  used  in  place  of  z„(Cn'  and  z*  (Cn)  ,  respec¬ 


tively. 


Let  us  now  summarize  the  time  cost  for  each  of  the  three 


situations: 


(1)  r^  is  not  found 

MCr  > 

rl 

(2)  r.  is  found,  butp,  is  not 

j  « 

(Ag+^z^Cr  )+A6A8z7(c  )  +  mr  [PiZ3(c  )+Ptz4(c  )] 


(3)  both  r,  andp,  are  found 
J  K 

P1Z3(CP)+^1Z4(CP)1  +[  P1Z  3(Cp  )+pl4Z  4((p^ 
(Note  that  A„=  0  or  aq  =  0  implies  m  =  1  and  Aft  - 1  and  Aft  =  1  implies 

DO  Tj  O  0 

in  general  that  m  ”>  1.) 

ri 

If  we  assume  for  the  moment  that  the  probability  that  the  first  of 
these  situations  occurs  is  given  by  yp  the  probability  that  the  second 
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occurs  is  given  by  y^,  and  the  probability  that  the  third  occurs  is 

given  by  y0 ,  then  the  time  cost  t,  t  will  be  given  by  the  following 
0  11 

expression; 


‘ll“Ta  +  yl  Z7(Cr,> 


+y2  f(S6+VZVCr.)+A6A8Z7(Cr,)+mr  [plZ3(Cp)+PlZ4(Cp.)]  } 


1  Xl 


+y3{Z7(Cr1)+(mr1"1)f  ^z3(V+plz4(Cp^  +fplZ3(Cp)+Pl Z  4(Cp^  * 

Let  us  now  investigate  the  determination  of  the  probabilities 
y^y^,  and  y3  (along  with  some  other  probabilities  for  which  we  will 
have  use  later). 

Let  us  define  an  event  A  as  "the  relation  symbol  r  is  associated 

with  the  source  d”,  where  re  P  and  de  A.  We  can  determine  the 

probability  that  event  A  is  true,  Pr{A},  as  follovo. 

Associated  with  each  de  A  are  k°  elements  ef  P .  C'ag-'ly,  for 

any  de  A  there  are  then  elements  of  the  k°  elements  of  P  for 
J  2T  4  r 

which  A  will  be  true.  Thus, 

k°±°4 

MA}  — Xi 

Similarly,  associated  with  each  reP  are  mrk^kO  elements  of 

A,  For  any  re  P  %ere  are  then  m  k”k°„  elements  of  the  k°  elements 

1  1  o  8, 
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erf  A  for  which  A  will  be  true.  It  follows  that 


Pr  {A} 


mrklk3 


Recall,  however,  that 


k°A° 

2f  4  a 

k°k°k° 
1  3  r 


Making  this  substitution  for  mr  yields  once  again 


Pr  {A} 


We  can  obtain  this  same  result  in  a  somewhat  different  way. 

Since  IaI  =k°  and  iPl  =k°  ,  tnere  are  k°k°  oossible  (d,r)  -  pairs, 
a  v  7  ar*  ' 

However,  since  there  are  k°k^  elements  of  P associated  with  each 
element  of  A,  only  k^k^k®  (d,r)  -  pairs  actually  cause  event  A  to  be 
true .  Thus, 


Pr  {A}  = 


k°k° 
a  r 


Let  us  define  an  event  B  as  "the  relation  symbol  r  is  associated 
with  the  target  P",  where  rc  Pand  pe  In  a  manner  similar  to  that 
for  Pr{A}  we  can  show  that 
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Pr{B}  = 


mrk°r 
P 

For  instance,  associated  with  each  pe  IT  are  k<^k°  /mr  elements 
of  P.  Since  lPl  -  ,  the  expression  for  PrfB}  follows  directly. 

Let  A*  be  the  event  "the  relation  symbol  r  which  a  given  -block 
(or  bg-block)  represents  is  associated  with  the  source  d",  where 
re  P  and  deA.  Since  there  are  k°k^  sources  associated  with  each 
b_ -block  (or  b„-biock)  and  a  total  of  k°  sources,  it  follows  that 


Pr{A’}  = 


kOkO 


Similarly,  let  B'  be  the  event  "the  relation  symbol  r  which  a 
given  b^-block  (or  bg-block)  represents  is  associated  with  the  target 
p",  where  re  P  and  pell.  Since  there  are  k°k°Q  targets  associated 
with  each  b^ -block  (or  bg-block)  and  a  total  of  k°  targets,  we  obtain 


Pr  {B’}  = 


k°8k°10 


Let  A"  be  the  event  "one  of  the  m  b„-blocks  (of  the  untrans- 

r  6  ' 

P 

formed  SSM)  representing  the  relation  symbol  r  and  associated  with 


191 


the  target p  is  associated  with  the  source  d",  where  de  A,  ret,  a 

pen. 


Associated  with  the  -block  representing  the  source  d  are 
bg-blocks  representing  distinct  relation  symbol.  We  wish  to 
know  the  probability  that  one  of  these  b^-blocks  coincides  with  one 
of  the  mr  bg-blocks  representing  the  relation  symbol  r  and  assc- 

rp  b 

ciated  with  the  b^ -block  representing  the  target  p.  Since  there  are 

m„  b„-blocks  which  may  be  grouped  into  sets  of  mr  ,  we  have  mg/mr 

rp  rp 

sets  from  which  to  choose.  Furthermore,  since  we  have  k<2jk° 
opportunities  for  success, 


Pr  {A”}  = 


k°2-k4 

■Vmr 


k°„k°  m 
2  4  r 


kOko 


p  k°k°k° 


k°vo 
i  l  3 

r«  k° 

p  ka 


Finally,  let  C  be  the  e^ent  "the  source  d  is  associated  with  the 
target  p",  where  de  A  and  pe  n. 

Recall  the-old-ball-in-the-urn  trick:  tor  k  =  0,  1,  -  •  •  ,n,  the 
probability  of  the  event  that  one  will  score  exactly  k  successes 
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(where  a  success  corresponds  to  drawing  a  rec-  ball)  when  one  draws 
withou.  replacement  a  sample  of  size  n  *rom  an  urn  containing  M 
b .Jib.  of  which  m  are  red,  is 


where 


Pr1Ek}=  ,') 


(m)k  <M-ra)n_k 


l  n.  _  _ nl 

k '  k!  k!  (n-k)! 


(M)n  =  M(M-l)  •  •  •  (M-n+1)  and  (M)0  -  1 

For  our  mrposes  here  instead  of  containing  balls,  the  urn  will 

contain  the  sets  of  m  b --blocks  which  represent  a  given  relation 

r  0 
P 

symbol  associated  with  the  -block  representing  a  particular  target. 

The  k°Jc°  /  m  sets  c.f  bg-blocks  which  are  associated  with  the  b^- 
rp 

block  representing  liie  target  P  will  correspond  to  the  red  balls, 
and  our  sample  will  consist  of  the  k^k^  bg-  blocks  representing 
distinct  relation  symbols  and  associated  with  the  b^ -block  represent¬ 
ing  the  source  d„  Then 
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m  = 


k°7k°9 


m 


>■  k°2k°4 


Pr{c}  is  then  the  probability  that  our  sample  of  k°k°  bg-blocks 
will  contain  a  bg-block  from  at  least  one  of  the  k^k^/m^  sets  of 
bg-blocks.  Thus, 


Pr{c}  =  1  -Pr{E0} 

(  n  )  (m)0  (M-m^ 

'  '°  — m 

1  ° c  (M-m-n+1) 

=  1  ‘  M(M-l)*  •  •  (M-n+1) 


Clearly,  the  values  assigned  to  m  and  n  may  be  interchanged  without 
affecting  our  result. 

To  digress  for  a  moment,  we  may  also  apply  this  ball-in-the-um 
concept  to  the  event  A".  In  this  case  M  is  still  given  by 


M  = 


2  4a 

mr  k  lk  3 
P 


and  n  is  still  given  by  n  =  k^k  but  m  =1  since  we  are  interested 


i94 


in  only  a  single  set  of  m  bfi-blocks.  Furthermore,  we  wish  to  know 

rp  ° 

the  probability  of  exactly  one  success  (which  is  the  same  as  at  least 
one  success  in  this  case,  since  there  can  be  no  more  than  one). 

Hence,  Pr{A"}  will  be  given  by 


Pr  {A"}  =  Pr{E1} 


•<;> 


(in), 


(M)„ 

m(M-m)(M-m-l)°  •  •  (M-m-n+2 
=  n  M(M-1)‘  •  •  (M-n+1) 

(M-l)(M-2)»  •  (M-n+1) 


=  n 


n 

M 


M(M-l)*  •  •  (M-n+1) 


=  k0ok  ° 
2  4 


rar  k°k° 


13 


k<£k°k° 


4  a 


=  in 


vo  if  o 

K  i  3 


r  o 
p  k 

3. 

which  is,  of  course,  the  result  obtained  before. 

These  results  are  summarized  in  Table  4-4,  where  for  nota- 
tional  convenience  we  have  introduced  seme  new  symbols  xi  for 
ie{l,  2,  •  •  • ,  7}  to  represent  the  various  probabilities* 


x„  has  been  reserved  for  use  in  considering  uniqueness  of  type 
2,  for  which  there  must  be  defined  an  event  B"  analogous  to  event 


A". 
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Xj  =  Pr{A} 
x2  =  Pr{3} 
x3  =  Pr{A’} 
x4  =  Pr{B’} 

x5  =  Pr{A''} 
x?  =  Pr{c}  : 


k°2k°4 

k°r 

IfO  lrO 

k7  9 

“V  kr 
P 

k° 

a 


krio 


k° 

p 


=  m 


k°k° 

ka 


1  (M-m)(M-m-l)°  •  ♦  (M-m-n+1) 

•  •  (M-n+1) 


where  M  = 


m  = 


k°2k°4k° 

mrpklk°3 

k°7k°9 

m 


n  =k°2k°4 


Table  4-4,  Summary  of  Probabilities 
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Returning  now  to  consideration  of  t^ ,  we  see  that  the  prob¬ 
abilities  y^,  yg,  and  are  given  by 

yi  = 1_xi 
y2  =xx(l-x4) 

y3  =  xlx4 

Let  us  rewrite  the  expression  for  t-  - ,  incorporating  these  values 
for  yl»  y2’  311(1  y3* 


*11  =  Ta+  (1"X1J  z7  (Cr^ 

+  x1(l-x4){0B+^)^7(Cr1)+A6A8Z7(Cr1)+mr1t'>lz3Cp)^1Z4(C,J)]  } 

+  xlX4{z*7(c  )+(m  -1)[p1z3<C)+Piz4(C  )]  +[P1z*3(Cp)+o1z*4(CM]  } 


Now  let  us  consider  the  derivation  of  an  expression  for  t^g,  the 
nimber  of  time  units  required  to  perform  using  its  second  nethod. 
This  method  is  characterized  by  the  following  basic  steps: 


(1)  Access  all  occurrences  of  the  relation  symbol  r^  in  the 
storage  structure. 


(2) 


For  each  occurrence  of  the  relation  symbol  r^  exan  ine 
all  associated  sources  in  search  of  d^» 
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(3)  If  dj  is  f'U  *?,  examine  all  targets  in  the  corresponding 
target  set  in  search  ofp^. 


To  initiate  access  of  all  occurrences  of  the  relation  symbol  r., 

we  must  first  access  the  head  of  the  corresponding  relation  ring, 

which  hap  a  time  cost  of  T  time  units. 

r 

If  dj  is  not  associated  with  r.,  we  will  examine  all  sources 

associated  with  all  occurrences  of  r.  via  a K  ia  search  of  d.,  which 

]  o  1* 

we  will  not  find.  Since  the  probability  that  dj  is  not  associated  with 

r.  is  1-x, ,  the  effective  time  cost  for  this  situation  will  be  (l-x.)z_(C  ). 
I  1  '  1  5V  a' 


If  di  is  associated  with  r^  but  is  not  associated  with  this 
combination,  we  are  faced  with  two  possible  situations.  If  there  is  a 
single  copy  of  each  bg-block  associated  with  the  b^-block(s)  repre¬ 
senting  an  arbitrary  source  (i.e. ,  if  m  =  1),  the  bR-block  repre- 

ri  b 

serving  the  relation  symbol  r^  associated  with  the  source  d.  may  be 

determined  by  a*  if  on  the  other  hand  m  >  1,  all  b^-blocks  re- 
5  rj  }  6 

presenting  r^  and  all  their  assoc  iated  b^ -blocks  will  have  to  be 

considered  in  the  futile  search  for  p^.  In  other  words  a ^  must  be 

used  to  insure  that  no  copy  of  a  bg-block  representing  r^  has  been 

overlooked.  In  Loth  cases  (m  =1  and  m  >  1),  all  bn -blocks 

rl  rl  11 

associated  with  each  of  the  m  be-b' jcks  must  be  examined  via 

ri  6 

a ^  ‘i  the  search  for  p^.  Since  the  probability  that  d.  is  associated 
with  r,  butp^  is  not  associated  with  that  combination  is  x^(l-x^), 


198 


the  effective  time  cost  will  be 


*1<1-X4>[  <WzVCa>  +  a6A8z5(Ca>  + 

Finally,  if  dj  and  ^  ^re  associated  with  the  same  bg-block  which 

represents  r.,  we  may  use  ag  to  find  the  -block  and  bg-block 

representing  the  d./r.  combination  with  which  p.  is  associated, 
i  ]  k 

Since  there  are  m  bc-blocks  which  represent  r.  and  which  are 
rl  6  j 

associated  with  the  b^-block(s)  representing  dj,  we  would  expect 

a 

p,  to  be  associated  with  the  m  -th  b„-block  of  these.  Therefore, 

K  r.  o  7 

1  A 

for  m  -1  of  these  blocks  we  will  use  a.  and  for  the  m  -th  block 

ri  4  ri 

we  will  use  a*..  Our  effective  time  cost  for  this  situation  will  then  be 
4 

xlx4t  z5  <Ca>  +  <“ri"1)Z4(S)+  z*4  <Cp>] 

Combining  all  the  terms  obtained  above  yields  the  following 
expression  for  t^: 

+x1(l-x4)[a6+Vz*5(Ca)+a6a8z5(Ca)+mr1z4<Cp)1 

«1*4[  zVCa)  +  (mr1"1)z4(S>)  +  Z 4(S ^ 

Next  we  will  consider  t^,  the  number  of  time  units  required  to 
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perform  using  its  third  method.  Because  of  the  symmetry  of  the 
SSM  and  the  symmetry  of  methods  2  and  3  for  Qp  we  would  expect 
a  very  strong  similarlity  between  the  expressions  for  t^  and  tp. 


In  fact  if  mT)  =  1,  these  expressions  should  be  exact  analogs. 

rp 

us  then  assume  for  the  moment  that  m  is_  1.  The  resulting 

rp 


expression  for 


will  be 


Let 


*13  =Tr  +  (1-X2)l56(Cp) 

«2U-x5)[  <Vfyz  V«(S>  )+n,r,z2(Ca>l 

£t 

z6<Cp  -D*2<Ca)+z*2(Ca> ) 

Let  us  now  consider  what  effec*  m  ^  1  will  have  upon  this 

rp 

expression.  Quite  simply  if  m  >1,  there  will  be  more  than  one 

rp 

bg-block  which  represents  the  same  realtion  symbol  associated  with 

the  bp -block  representing  a  given  target.  This  means  that  whenp^ 

is  associated  with  r.  but  d.  is  not  associated  with  this  combination, 

we  must  consider  ali  bA-blocks  representing  r.  and  all  their  associated 

«  j 

b, ,  -blocks  in  the  futile  search  for  d.  in  order  to  insure  that  no  b_- 
11  1  o 

olock  representing  r.  and  associated  with  p,  is  overlooked.  This  is 

true  regardless  of  whether  or  not  m  =1.  Secondly,  instead  of  m 

r2  r2 
bA-blocks  to  consider,  we  will  have  m  m  bA-blocks  representing 

b  rp  r2  0 
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r\  to  consider. 

Let  6p  be  a  binary -valued  variable,  the  value  of  which  indicates 

whether  (5  =  1)  or  not  (5=0)  m  =1.  Also,  let  m  =  m  m  . 

p  P  rp  ZP  rp  r2 

Then  we  can  rewrite  the  expression  for  tjg  to  reflect  the  effects  of 

m  as  follows: 
rP 

*13  =  V  (1-X2)Z6(V 

+x2(l-x5)[  6  (53+a5)z*6(^W(6;+A  A  )z  (C  )+m  z  (C  )] 

P 

z*6(Cp'+(“z  -1)z2(Ca)+z*2(Ca)] 

P 

Finally,  let  us  consider  t^,  the  number  of  time  units  required  to 

perform  using  its  fourth  method.  Again,  because  of  the  symmetry 

of  the  SSM  and  the  symmetry  of  methods  1  and  4  for  Q^,  we  would 

expect  a  strong  similarity  between  the  expressions  for  t^  and  t^. 

The  comments  made  about  mr  in  our  discussion  of  t^  also  apply 

P 

here.  Thus,  we  can  easily  derive  the  following  expression  for  t^: 
t14=V(l-X2>z8<Cr2> 

(A3+A5)z*8(c  ),(F  kA3A5)^C  A^Ca^^a'l  > 

u  z  p 

tX2x5{z*8(Cr,)+<“z'1)[^Zl(Ca>+?3Z2(Ca)l  +[  V l(Ca)+%z*2(Ca>l > 

2  p 
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As  a  matter  of  academic  interest  we  should  like  to  be  assured 
that  the  probability  that  operation  Qj  is  successful  will  be  the  same 
regardless  of  the  method  of  implementation.  Clearly,  we  are 
guaranteed  that  this  is  so  if  it  is  true  that  x^x^  equals  x^x,..  Let  us 
check  then  whether  the  following  equality  is  true: 

xlx4  'Vs 


k>4 

irOifO 

8  10 

k«k» 

klk°3 

m 

ko 

r 

k° 

P 

mrk°r 

AAl 

rP  k° 

*  a 

k°2k4k8k<ioka  '  W, 

which  we  know  to  be  true. 

We  can  continue  in  the  manner  above  to  develop  expressions  for 
the  methods  for  each  of  the  remaining  primitive  operations.  Rather 
than  belabor  these  derivations  to  the  point  of  monotony,  we  have 
chosen  simply  to  summarize  the  expressions  for  the  various  t^  in 
Appendix  D. 
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4.3  Storage  Cost  Function 


To  complete  our  discussion  of  the  measures  of  performance, 
let  us  develop  an  expression  for  the  number  of  storage  units  required 
by  a  given  storage  structure. 

If  we  can  determine  the  number  of  storage  units  required  for 
each  type  of  block  in  a  given  structure,  then  we  may  sum  over  all 
block  types  the  product  of  the  number  of  storage  units  required  for 
each  type  of  block  and  the  number  of  blocks  of  that  type  in  the 
structure  to  determine  the  basic  storage  cost  of  the  structure.  For 
instance,  if  represents  the  number  of  storage  units  for  each  bj- 
block,  then  the  basic  storage  cost  will  be  given  by 


/.  m.  u. 
1=1  1  1 


To  determine  the  entire  storage  cost  of  a  structure  we  musi 
add  to  this  expression  the  number  of  storage  units  required  by  the 
various  description  blocks  and  the  source,  relation,  and  target  ring 
heads.  Let  u^  represent  the  (average)  number  of  storage  units 
required  by  a  description  block  and  let  ua,  ur,  and  u^  represent  the 
corresponding  quantities  !or  the  source,  relation,  and  target  ring 
heads,  respectively.  The  quantity  to  be  added  to  the  basic  storage 
cost  will  then  be  given  by 
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n  u ,  +  k°  u  +  k°  u  +  k°  u 
d  a  a  r  r  p  p 


Thus,  we  may  define  the  storage  cost  S  for  a  given  storage 
structure  as 
11 


8  =  7  m.  u.  4  n  u ,  +  k°  u  +  k°  u  +  k°  u 
•  l  l  d  aa  rr  pp 


Let  us  consider  now  the  determination  of  m  for  all  appropriate 
values  of  i.  In  general,  a  t^-block  consists  of  a  number  of  fields 
containing  pointers  for  the  various  rings  which  may  pass  through  it, 
possibly  a  type  field,  and  (if  ie{5,6,  7})  possibly  a  relation  symbol 
name  field.  If  we  know  the  fields  of  which  a  b. -block  is  composed, 
we  can  sum  the  numbers  of  storage  units  required  by  these  fields 
to  determine  u.. 

Let  us  assume  that  a  field  containing  a  forward  pointer  for  an 

a.-ring  requires  et  units  of  storage  and  that  afield  containing  a  head 

pointer  for  an  a^-ring  requires  s^  units  of  storage,  where  ig  { 1 , 2 ,  ••«,  ID}. 

Further,  let  us  assume  that  a  description  block  indicator  requires  s^ 

units  of  storage  and  that  a  pointer  in  a  source,  relation,  or  target 

ring  requires  s  units.  Finally,  assume  that  a  relation  symbol  name 
P 

field  requires  sr  units  of  storage  and  a  block  type  field  requires  s^ 
units.  (If  the  type  code  does  not  require  a  separate  field,  we  will 
assume  that  s^  =0.) 
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As  an  example  let  us  consider  the  derivation  of  an  expression 
for  Up  the  number  of  storage  units  required  by  a  -block.  A  block 
of  this  type  contains  a  pointer  field  for  the  appropriate  source  ring; 
if  0^  =1,  it  contains  a  pointer  field  for  an  a^-ring;  if  0^  =1,  it  con- 
ains  a  pointer  field  for  a  head  pointer;  if  Oj  =  1 ,  it  contains  a  pointer 
field  for  a  description  block  indicator;  and  finally,  if  =  1,  it  con¬ 
tains  a  type  field.  As  a  result,  u^  may  be  characterized  by  the 
following  expression: 

ui  =  V  0i%+0ish1+,7isd  +  Tist 

In  a  similar  mannei  we  may  obtain  expressions  for  all  other 
block  types. 


U11  =  SP  +  *L C  \  **10  V  *  a  2  Sd  +  T11  St 


L10 


10 


ui  =  sf.  f  ^  sf.  +  Ti  st>  for  i€  {2>4>M°} 

ui  =? i .<  0i-l  sfi_1  +  0U  \_x  +  0i  %  +  0i  sh.  +  Ti  £t} 

for  ie  {3,9} 


u5  =  *5  (04  Sf4  "  04  sh4  +  05  Sf5  +  05  Sh5  +  P1  sr  +  T5  V 

U6  =  ^6  *  05  Sf .  *  06  Sf„  +  sp  p2  Sr  +  T6  St' 

D  b 

u7  =  V <  06  sf  .  +  06  \  +  07  sf ,  +  07  Sh7  +  p3  Sr  +  T7  st  > 

o  b  (  ( 
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We  note  that  as  a  practical  matter  it  is  often  the  case  that  , 

1 

s,  ,  s,,  and  s  are  all  equal  (for  all  ie  {l, 2, •  •  •  10}}. 

li.  vi  1 f 
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Chapter  V 


A  PROCEDURE  FOR  THE  DETERMINATION  OF  A 
MINIMUM  COST  STORAGE  STRUCTURE 

In  ine  previous  chapter  we  developed  two  measures  of  perfor  - 
trance  -  a  time  cost  function  T  and  a  storage  cost  function  S  -  for  our 
Storage  Structure  Model.  Our  goal  in  this  chapter  will  be  to  develop 
a  procedure  which  utilizes  these  measures  to  determine  a  minimum 
cost  storage  structure  for  a  given  data  structure.  (Recall  that  we 
consider  a  storage  structure  to  have  minimum  cost  if  it  minimizes 
T  subject  to  a  given  constraint  on  S. ) 
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5. 1  Reducing  the  Number  of  Feasible  Solutions 


T  and  S  are  functions  of  a  large  number  of  variables.  In  order 
to  facilitate  referring  to  these  variables  we  will  partition  them  into 
two  classes:  parametric  variables  (or  simply,  parameters)  and 
decision  variables.  The  parametric  variables  are  those  which 
characterize  the  environment  in  which  our  storage  structure  is  to 
exist  (e.g. ,  s.,  f^  k9,  and  s^  )  and  are  those  over  which  we  assume 
no  control  can  be  exercised  to  minimize  T  and  S.  The  decision 
variables,  on  the  other  hand,  characterize  the  form  of  the  storage 
structure  itself.  In  particular,  the  class  of  decision  variables 

consists  of  01»“‘01O,  ^••c0Jo>Aie-“Aio^2”°/3lO»  °V  a2fpVp2> 
and  py  Thase  are  the  variables  over  which  the  cost  minimization 

is  to  be  performed. 

We  know  from  our  discussions  of  Chapter  III  that  the  values  of 
the  various  decision  variables  may  not  all  be  specified  independently. 
For  instance,  the  values  of  (3^  •  -■  (3^^  are  determined  uniquely  once 
values  are  assigned  to  Aj° 0  311(1  p3‘  Let  us  assume  for 

the  moment,  however,  that  the  decision  variables  • »  0^ 

0j“°0jO>  Ai“°*Aioj  aVu2,pl’p2,  311(1  P 3  are  all  independent. 

Since  each  of  these  variables  is  binary -valued,  this  implies  that  it 
is  possible  to  specify  2^  or  roughly  3x10*^  different  storage 
structures  via  these  variables. 
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Clearly,  some  caution  must  be  exercised  in  choosing  a  method 
for  determining  that  storage  structure  which  satisfies  our  conditions 
of  optimality  lest  solution  of  our  problem  become  computationally 
infeasible. 

Unfortunately,  the  situation  is  further  complicated  by  the  fact 
that  the  time  cost  function  T  is  rather  ill-behaved.  First,  T  cannot 
feasibly  be  written  as  an  explicit  function  of  the  decision  variables. 
Second,  T  is  a  monotone  function  of  none  of  the  decision  variables 
except  PpP2>  311(1  P3«  As  a  result,  T  does  not  lend  itself  to  minimi¬ 
zation  by  any  of  the  common  optimization  techniques  for  functions  of 
zero-one  variables. 

In  order  to  reduce  the  number  of  ’feasible  solutions”  which  must 
be  considered  in  determining  the  optimal  storage  structure,  let  us 
make  a  number  of  observations  and  decisions  based  upon  these 
observations. 

Consider  for  a  moment  an  a. -ring  and  a.+^-ring  pair  where 
ie  {2, 4,6, 8}.  In  general,  this  ring  pair  may  assume  any  of  the  forms 
given  in  the  following  table: 
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2* 

I 


airing 

ai+rring 

\  Vi 

^i_ 

*w 

ring 

ring 

1  1 

0 

0 

ring 

stack 

1  0 

0 

0 

stack 

ring 

0  1 

0 

0 

ring 

duplicated 

1  0 

0 

1 

duplicated 

ring 

0  1 

1 

0 

stack 

duplicated 

0  0 

0 

1 

duplicated 

stack 

0  0 

1 

0 

duplicated 

duplicated 

0  0 

1 

1 

We  note  that  the  time  cost  required  to  sequence  through  a  stack 
is  less  than  or  equal  to  that  required  to  sequence  through  a  ring 
of  the  same  composition.  We  also  note  that  a  ring  and  a  stack  have 
the  same  basic  structure  with  regard  to  the  ordering  and  acces- ability 
of  their  elements.  Furthermore,  a  stack  requires  less  storage  than 
a  ring  of  the  same  composition. 

Since  we  are  not  concerned  with  manipulative  operations  (for 
which  a  ring  can  be  preferable  to  a  stack),  it  follows  that  the  structures 
"ring-stacl.  and  "stack-ring"  for  the  a^-ring  and  a^-ring  pair  will 
always  require  less  time  for  equivalent  operaiions  and  less  storage 
than  the  structure  "ring-ring". 

Since  the  remainder  of  the  SSM  is  the  same  regardless  of  which 
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of  the  three  structures  "ring-ring",  "ring-s  tck",  or  *fetack-ring"  is 
implemented  for  the  given  ring  pair,  it  is  always  advantageous  to 
choose  one  of  the  latter  two.  We  can  therefore  exclude  0rt+i=i  for 
ie{2,4,6,C}  from  further  consideration. 

Suppose  that  0^  =  0  and  0.  Then  each  a^-ring  and  a^-ring 
pair  functions  just  as  the  a^-ring  and  a.  ring  pair  just  discussed 
(assuming  that  A  and  II  are  not  disjoint).  It  follows  that  if  a^  =  0  and 
a 2  =  0,  we  can  exclude  0^=  0^  =  1  from  consideration.  (Recall  that 
when  0^  =  0  and  a^=  0  at  least  one  of  0^  and  01Q  must  be  1,  however.) 

If  on  the  other  hand  cr^  =  1  or  1  or  both,  then  the  aj -rings  and 
the  a^- rings  will  be  independent  of  one  another  (as  well  as  of  the 
rest  of  the  SSM).  Applying  the  stack -versus -ring  arguments  to  each 
in  turn  leads  us  to  conclude  that  it  is  always  advantageous  to  make  the 
a^ -rings  and  the  a^-rings  stacks  instead  of  rings.  Thus,  when  a^=  1 
or  a2=  1  or  both,  we  should  always  set  0^  =  0  and  0^Q=  0. 

These  observations  have  allowed  us  to  reduce  the  number  of 
comoinations  of  values  for  0j'  •  °  0jq  from  1024  to  at  most  81  when 
o^=  1  or  o2=  1  or  both  and  to  at  most  162  when  0  and  o2=  0. 

Let  us  now  examine  the  role  of  the  head  pointer  in  an  arbitrary 
a^-ring.  If  for  a  particular  operation  the  direction  of  access  is  away 
fro..1  the  head  of  the  ring,  the  head  pointer  can  make  no  contribution 
We  can  say  in  general  then  that  if  there  are  no  operations  to  be 
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performed  upon  the  structure  such  that  the  direction  of  access  is 
toward  the  head  of  the  a. -ring,  there  is  no  benefit  to  be  obtained 
from  a  head  pointer  in  that  ring  and  we  car  set  0!  n.  This  will, 
of  course,  aTso  tend  to  minimize  the  storage  requirement  of  the 
structure. 

Suppose  on  the  other  hand  that  there  is  at  least  one  operation 
to  be  performed  upon  the  structure  such  that  the  direction  of  access 
is  toward  the  head  of  the  a^-ring.  Since  we  expect  to  enter  the  ring 

A 

at  the  k.-th  element, either  we  can  use  the  head  pointer  to  go  directly 
from  this  element  to  the  head  or  we  can  step  through  the  intervening 

A 

k^-1  elements  to  reach  the  head,  where  the  time  required  to  step 
from  one  element  to  the  next  is  either  si  or  fi  depending  upon  whether 
the  airing  is  a  stack  or  an  explicit  ring,  respectively.  It  follows 
that  in  order  to  accrue  any  benefit  from  a  head  pointer  the  time  cost 
required  to  follow  it  must  be  less  than  or  equal  to  that  required  to 
step  through  the  intervening  elements. 

If  the  a^-ring  is  an  explicit  ring  (0.-  1),  this  means  that 

A 

h.  <  k.f. 
i  -  ii 

must  be  true  in  order  to  gain  advantage  from  0!  =  1„  (Recall  our 
earlier  assumption  that  <  L  <_h.  for  ie  {1 , 2,  •  •  •  10}„ ) 

If  the  a^-ring  is  a  stack  (0^=  0  and  a^=  0),  however,  the  situation 


becomes  slightly  more  complex.  Setting  0!=  1  when  ^=0  may  mean 

that  the  time  required  to  follow  the  head  pointer  is  h.+  so  instead  of 

h.„  (See  the  expressions  for  the  various  z9. )  Thus,  if  0!=  1  results 

in  the  inclusions  of  an  s  term,  the  condition 

o  * 


h.  +  s„  <  k.  s. 

1  O—ll 


must  be  satisfied  for  0!=  1  to  result  in  a  time  reduction.  On  the 
other  hand  if  no  such  inclusion  results,  the  condition 


A 

h.  <  k.  s. 

l—ii 


must  be  met. 

The  result  of  this  is  that  the  values  of  0^*  •  •  0jQ  may  be  uniquely 
determined  from  the  values  of  0^ » » •  0^  and  A^°  •  °  (given  actual 
values  for  sQ  and  s^  L,  and  h .  for  ie{l,2, •  •  •  10}),  which  reduces 
the  number  of  solutions  which  must  be  considered  by  a  factor  of 

3 

roughly  10  . 

To  further  reduce  the  number  of  solutions  which  must  be  con¬ 
sidered,  let  us  treat  the  variables  a1,a2»Pi»/52»  311(1  P3  35  external 
decision  variables.  That  is,  let  us  assume  that  we  assign  values 
(externally  to  the  solution  process)  to  these  variables  and  then  solve 
for  the  optimal  storage  structure  subject  to  the  constraints  of  these 
values. 
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As  a  practical  matter  this  assumption  does  not  appear  to  be  very 
restrictive  since  the  overall  optimal  solution  will  probably  result  in 
the  values 


P1  p2  p3 


most  of  the  time  anyway.  In  any  event  all  possible  combinations  can 
be  considered  if  so  desired  by  specifying  them  individually. 

We  are  now  faced  with  the  problem  of  determining  values  for 
only  and  Aj*  •  •  A^q  such  that  our  optimality  conditions  are 

satisfied.  It  so  happens  that  the  constraints  upon  these  variables 
(Chapter  HI)  further  reduce  the  number  of  solutions  to  be  considered 
by  a  factor  of  10. 

In  general  terms  we  have  reduced  the  number  of  solutions  which 


must  be  considered  by  a  factor  of  10  via  our  decisions  concerning 

3 

01  •  •  ’  0jq,  a  factor  of  10  by  our  decisions  involving  0'^ « •  0'^  a 


factor  of  30  for  OpCr^ppPg,  and  pg, 


and  a  factor  of  10  from  the 


constraints  on  0^*  •  •  0^  and  Aj»  •  •  A^q.  As  a  result  we  are  faced 

4 

now  with  considering  only  10  possible  solutions.  (In  fact,  when 


(7j  and  (jg  are  not  both  0,  we  need  consider  <it  most  8960  solutions.) 


Although  we  might  wish  to  reduce  this  number  further  (which  may 
in  fact  be  impossible  to  accomplish  through  reasonable  effort),  it 
falls  well  within  the  computational  limits  of  the  computer  and,  hence, 
^.ould  not  concern  us  unduly  at  this  point. 

We  will,  however,  indicate  a  special  case  which  allows  us  to 
reduce  the  number  of  solutions  which  must  be  considered  by  a  factor 
of  approximately  2.  Because  of  the  symmetry  of  the  SSM,  if  we 
assign  values  to  the  various  parameters  and  external  decision  vari¬ 
ables  in  a  symmetric  manner  (e.g. ,  =k^Q,  k°  =k®  ,  a 2, 

etc.),  each  solution  will  have  a  "mirror  image"  which  has  the  same 
time  and  storage  costs.  For  instance,  if  0^«  •  •  0^  =  00010Q0010  and 
Al°  *  A10  =  0110111001  *s  one  solution  and  if  the  various  parameters 
and  external  decision  variables  have  symmetric  values,  then  the 
mirror  image  solution  0^*  •  •  0^  =  0100001000  and  •  •  A^q  = 
1001110110  will  have  the  same  time  and  storage  costs  as  the  first 
solution. 

This  means  that  once  we  have  considered  a  given  solution  we 
need  not  consider  its  mirror  image.  Note  that  certain  solutions  are 
their  own  mirror  images.  Hence,  we  must  consider  something  over 
half  of  all  the  possible  solutions. 
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5. 2  Optimization  Procedure 


Let  us  now  consider  the  basic  procedure  which  we  Mil  use  to 
determine  the  solution  which  satisfies  our  optimality  conditions. 

First  we  assign  values  to  the  various  parameters  and  external 
decision  variables.  We  then  generate  a  sequence  of  all  the  value 


combinations  for  0^«  •  •  For  each  of  these  value  combinations 
we  generate  a  sequence  of  all  those  value  combinations  for  A^*  •  •  Ajq 
which  satisfy  the  constraints  of  Chapter  III.  For  each  resultant 
assignment  of  values  to  0^*  •  •  0^  and  A^«  •  •  A^q  we  may  determine 
the  corresponding  values  of  0!  ■>  •  •  0’^  and  A  given  com¬ 

bination  of  values  for  0^. . .  0^,  0l°°°0io’  AI°“A10’  and  P2°  *  *  010 
describes  a  possible  state  of  the  SSM. 

Our  goal  is  to  deter  nine  that  state  of  the  SSM  which  minimizes 
the  time  cost  function  T  subject  to  a  possible  limit  on  the  value  of  the 
storage  cost  function  S. 

Let  represent  the  i-th  state  in  the  sequence  of  states,  where 
ie{l,  2,  •  •  •  ,M}  and  M  is  the  number  of  states.  Assume  that  the  state 


77.  j  is  considered  before  the  state  ^  which  in  turn  is  considered 
before  the  state  77^  and  so  forth. 

Let  T.  and  represent  the  values  of  T  and  S,  respectively,  for 
the  state  77.. 

Let  SQ  represent  the  upper  bound  to  be  placed  upon  the  storage 
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cost  function  S. 


I 

I 


We  may  then  define  the  following  procedure  to  determine  the 
optimal  state  tj*  of  the  SSM. 

(1)  Set  T*  =  oo  and  i  =1. 

(2)  If  SQ  =  oo,  go  to  step  (3). 

Otherwise  if  Si  >  SQ,  go  to  step  (4). 

(3)  If  T.>  T*,  go  to  step  (4). 

K  T.  <  T*,  sot  T*  =  T.,  S*  = 

i)*  =  r^,  and  go  to  (4) 

If  T.  -  T*  and  S.  <  S*,  set  S*  =  S.,  r f  =  t^, 

and  go  to  (4). 

(4)  Set  i  =  i+ 1 . 

If  i  <  M,  go  to  step  (2). 

Otherwise,  stop,  rj*  is  the  optimal  state  of  the  SSM. 

To  illustrate  the  feasibility  of  this  approach  the  procedure  has 
been  implemented  via  a  prototype  program  on  the  Michigan  Terminal 
System  (MTS)  for  the  IBM  360/67  at  The  University  of  Michigan.  In 
the  next  chapter  we  will  consider  some  results  obtained  through  the 
use  of  this  program. 
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Chapter  VI 


APPLICATION 

Tn  the  preceding  chapters  we  have  developed  a  model  for  data 
structure,  a  model  for  the  storage  structures  which  can  represent 
any  data  structure  specified  by  the  data  structure  model,  and 
finally  a  technique  for  determining  a  minimum  cost  storage  structure 
to  represent  a  particular  data  structure  in  the  solution  of  a  given 
problem. 

In  this  chapter  we  will  apply  these  tools  to  a  specific  problem  to 
determine  a  minimum  cost  storage  structure  for  the  data  associated 
with  the  solution  of  that  problem,  we  will  compare  the  results  we 
obtain  to  the  storage  structure  used  in  an  existing  system  for  solving 
that  problem,  and  we  will  examine  the  sensitivity  of  our  results  to 
variations  in  the  parameters  used  to  describe  the  problem  and  the 
structure  of  its  data. 
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6. 1  The  Problem:  A  System  for  Medical  Diagnosis 


As  the  example  to  illustrate  the  use  of  the  techniques  we  have 
developed,  let  us  consider  the  representation  of  data  associated 
with  a  computer  system  for  aiding  medical  diagnosis. 

In  the  next  few  sections  which  follow  we  will  concern  ourselves 
with  a  general  description  c,i  a  diagnostic  system,  the  specification 
of  the  data  and  the  data  structure  for  a  particular  diagnostic  system, 
and  the  determination  of  a  minimum  cost  storage  structure  for  that 
system. 

6.1.1  General  Description 

In  the  general  sense,  the  diagnostic  problem  is  to  ascertain  the 
current  state  of  some  given  system.  The  diagnostician  uses  infor¬ 
mation  acquired  from  past  experience  with  such  systems,  coupled 
with  specific  observations  or  tests  of  the  given  system,  in  order  to 
deduce  the  identity  of  its  current  state. 

In  particular,  the  medical  diagnostician  is  concerned  with  deter¬ 
mining  the  "state”  of  his  patient.  The  physician  has  learned  through 
training  and  experience  the  sign  and  symptom  patterns  associated 
with  possible  diseases  from  which  the  patient  can  suffer. 

In  theory,  a  given  set  of  signs  and  symptoms  (hereafter  referred 
to  simply  as  symptoms)  should  characterize  each  disease  uniquely. 
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In  practice,  however,  different  diseases  may  result  in  similar 
symptoms.  Observation  of  many  different  symptoms  may  be  re¬ 
quired  to  identify  a  particular  disease,  and  a  given  symptom  may 
suggest  many  possible  diseases.  This  fact  coupled  with  the  lar  ;e 
number  of  symptoms  and  diseases,  requires  the  diagnostician  to 
master  considerable  amounts  of  information. 

The  diagnostic  process  is  further  complicated  by  a  number  of 
other  factors.  First,  the  relationships  between  symptoms  and 
diseases  are  often  known  only  in  probabilistic  terms.  Second,  the 
tests  required  to  determine  whether  indeed  certain  symptoms  exist 
may  exact  a  high  cost  (in  terms  of  risk  to  the  patient,  patient  discom¬ 
fort,  money,  etc.).  This  cost  must  be  weighed  against  the  potential 
usefulness  of  the  test  results  Finally,  because  of  the  vast  amounts 
of  information  required  in  the  diagnostic  process,  those  symptom 
patterns  and  testing  strategies  associated  with  seldom  encountered 
diseases  may  be  effectively  lost  to  (i.e. ,  forgotten  by)  the  diagnosti¬ 
cian. 

By  using  a  computer  program  to  provide  general  diagnostic 
assistance  to  its  user,  a  number  of  these  difficulties  can  be  overcome, 
or  at  least  minimized.  One  of  the  principal  advantages  to  be  gained 
from  the  use  of  a  computer  is  the  sheer  bulk  of  information  which  it 
can  maintain.  Because  a  program  can  consider  more  possible 
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diagnoses  than  a  human  being,  it  can  provide  a  strong  safeguard  that 
a  particular  disease  is  not  overlooked  in  the  diagnosis. 

For  these  reasons  there  has  been  increasing  interest  in 
computer-  aided  diagnosis.  In  particular,  a  number  of  programs 
have  been  written  which  are  capable  of  performing  diagnosis 
in  particular  medical  areas.  As  a  rule,  these  programs 

employ  a  Bayesian  analysis  of  symptoms  based  on  a  disease-symptom 
probability  matrix  for  the  given  set  of  diseases  considered.  That  is, 
the  programs  compute  the  probability  of  disease  D  given  the  symptom 
profile  S  (i.  e. ,  ihe  set  of  symptoms)  as  follows: 

?(DlS)  =  P<D>P(S-|D> 

Yj  p(d)p(s  Id) 

D 

where  P(D)  is  the  a  priori  probability  of  the  occurrence  of  disease 
D  and  P(S  ID)  is  the  conditional  probability  that  S  occurs  given  D. 

Since  it  is  not  ouv  purpose  to  design  a  complete  diagnostic  system, 
we  refer  the  reader  to  the  literature  for  more  complete  descriptions 
of  the  mathematical  techniques  involved. 

6.1.2  A  Particular  Diagnostic  System 

Gorry  [  25  ]  has  designed  and  implemented  a  general  purpose 
diagnostic  system  (although  the  examples  he  has  considered  are 
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exclusively  in  the  area  of  medical  diagnosis)  which  is  representative 
of  these  systems  and  is  the  one  which  we  shall  use  as  a  reference. 

Let  us  examine  the  basic  assumptions  and  characteristics  of 
this  system.  The  objective  of  the  diagnostic  process  for  the  medical 
problem  is  to  determine  the  malady  or  disease  with  which  an  individual 
is  afflicted.  This  disease  is  assumed  to  be  one  of  a  finite,  but  perhaps 
quite  large,  number  of  possible  diseases.  Information  concerning  the 
given  disease  can  be  obtained  by  performing  a  variety  o*  tests  upon 
the  patient.  These  tests  may  range  from  simple  questions  as  in 
history -taking  to  complicated  medical  procedures  such  as  exploratory 
surgery.  The  results  of  tests  applied  to  the  patient  are  then  combined 
by  the  diagnostician  with  his  experience  with  other  diagnostic  problems 
to  deduce  the  particular  disease.  In  the  case  of  the  diagnostic  program 
this  experience  is  reflected  in  probability  distributions  which  charac¬ 
terize  the  results  of  certain  tests  given  a  particular  disease.  Speci¬ 
fically,  this  information  relates  symptoms  (the  results  of  the  tests) 
to  particular  diseases.  A  symptom  is  assumed  to  be  binary-valued 
(i.e. ,  either  present  or  absent)  and  a  teat  is  used  to  determine  the 
presence  or  absence  of  some  number  (perhaps  greater  than  one)  of 
symptoms. 

Associated  with  each  test  is  a  cost  of  applying  it  to  the  patient 
(in  terms  of  risk  or  discomfort  to  the  patient,  the  services  of  skilled 
personnel,  money,  etc.)  and  therefore  it  is  advantageous  to  make  a 
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decision  about  the  disease  based  upon  a  limited  number  of  tests.  On 
the  other  hand  associated  with  each  disease  is  a  cost  reflecting  the 
loss  resulting  from  diagnosing  that  disease  as  another.  (Diagnosing 
a  malignant  tumor  as  benign  is  very  costly  ’  should  clearly  be 
avoided . )  The  possibility  of  loss  for  an  incorrect  diagnosis  tends 
to  value  extensive  testing  prior  to  making  a  decision.  One  must 
therefor  a  balance  the  testing  cost  against  the  cost  for  an  incorrect 
decision  in  an  effort  to  minimize  the  overall  expected  cost. 

To  accomplish  the  goals  above,  Gorry's diagnostic  system  performs 
three  logical  functions: 

(1)  The  interpretation  of  the  symptoms  for  a  particular 
problem  via  Bayes'  rule,  given  conditional  probabilities 
that  particular  symptoms  are  associated  with  certain 
diseases  and  given  a  priori  probabilities  that  these 
diseases  will  occur  in  a  given  population.  This  is 
called  the  inference  function. 

(2)  The  selection  of  tests  to  be  applied  to  the  patient  under 
consideration  in  order  to  obtain  further  symptoms.  This 
is  called  the  test  selection  function. 

(3)  The  analysis  of  the  symptoms  observed  to  determine 
whether  there  are  any  irrelevant  symptoms  present 

(such  as  a  sore  elbow  when  all  other  symptoms  indicate 
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typhoid  fever)  or,  in  the  general  case,  to  detect  symp¬ 
tom  patterns  for  more  than  one  disease  occuring  simul¬ 
taneously.  This  is  called  the  pattern-sorting  function. 

The  use  of  the  inference  function  should  oe  quite  clear,  but  the 
test  selection  function  and  the  pattern- sorting  function  may  require 
brief  descriptions. 

Consider  first  the  test  selection  function.  The  purpose  of  this 
function  is  to  select  tests  in  such  a  way  as  to  minimize  the  overall 
expected  loss  of  the  diagnosis.  For  finite  numbers  of  symptjms  and 
diseases  and  a  finite  number  of  potentially  useful  test  sequences  the 
optimal  choice  of  tests  can  be  obtained  by  constructing  a  decision 
tree  and  folding  back  this  tree  in  terms  of  expected  loss.  Such  a  tree  - 
a  portion  of  which  appears  in  Figure  6-1  -  consists  of  two  types  of 
nodes:  decision  nodes  and  what  Gorry  calls  "nature’s  nodes". 

A  decision  node  represents  the  current  status  of  the  diagnostic 
problem  as  given  by  the  probability  distribution  over  the  states  (i.e.  , 
diseases)  of  the  system.  Emanating  from  each  decision  node  are  a 
number  of  branches,  one  for  each  possible  test  which  can  be  performed 
and  one  corresponding  to  a  terminal  decision.  Each  branch  corre¬ 
sponding  to  one  of  the  tests  which  can  be  performed  leads  to  one  of 
nature's  nodes. 

Associaied  wit!  each  of  nature’s  nodes  are  a  number  of  branches 
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Decision 

Node 


representing  the  possible  outcomes  (i.e. ,  symptoms)  which  can  result 
from  performing  the  test  corresponding  to  the  branch  which  leads  to 
this  node.  Each  of  these  ’’outcome  branches"  leads  in  turn  to  another 
decision  node. 

Thus,  if  somewhere  in  the  course  of  performing  a  diagnosis  we 
encounter  decision  node  d^  of  Figure  6-1,  we  may  choose  to  perform 
any  one  of  the  tests  T^4  •  •  TR  in  order  to  gain  further  information 
about  the  state  of  the  system  under  consideration  (i.e. ,  about  the 
disease  afflicting  our  patient),  or  we  may  choose  to  make  a  final 
diagnosis.  If  we  decide  to  make  a  final  diagnosis,  wt  follow  the 
branch  of  the  decision  tree  which  leads  from  decision  node  d^  to  the 
terminal  decision  D  and  the  diagnostic  process  is  concluded.  Suppose 
on  the  other  hand  we  decide  to  perform  test  T^.  In  this  care  we 
follow  the  branch  from  decision  node  to  nature’s  node  n^.  Depend¬ 
ing  upon  whether  performing  the  test  results  in  symptom  s^  or 
Sg,  we  will  follow  the  branch  from  nature's  node  n^  to  decision  node 
dg  or  decision  node  dg,  respectively.  This  newly  accessed  decision 
node,  with  the  probability  distribution  of  the  states  <  the  system 
updated  to  reflect  the  information  conveyed  by  the  symptom  leading 
to  the  node,  now  represents  the  current  states  of  the  diagnosis.  At 
this  node  we  again  have  a  choice  between  making  a  final  diagnosis  and 
performing  one  of  several  tests  to  obtain  additional  information. 


226 


Since  the  number  of  decision  nodes  of  the  decision  tree  grows 
exponentially  with  the  number  of  symptoms  and  tests  and  since  we 
may  commonly  expect  to  encounter  pr  jblems  involving  large  numbers 
of  symptoms  and  tests,  it  is  generally  computationally  infeasible  to 
search  the  entire  decision  tree  (or  even  a  large  portion  of  it)  in  order 
to  determine  the  optimal  sequence  of  tests  to  be  performed  to  minimize 
the  overall  cost  of  the  diagnosis  .  Therefore,  Gorry  has  developed 
an  heuristic  which  allows  him  to  search  a  relatively  small  portion  of 
the  decision  tree  below  each  decision  node  encountered  in  the  diag¬ 
nostic  process  in  order  to  determine  whi<h  test  should  be  performed 
at  that  node.  While  this  heuristic  results  in  a  sub-optimal  sequence 
of  tests,  the  number  of  decision  nodes  which  must  be  considered  is 
greatly  reduced,  which  more  than  onsets  this  disadvantage. 

Starting  with  thi  root  of  the  decision  tree  (i.e. ,  the  decision 
node  which  represents  the  initial  status  of  the  diagnostic  process),  the 
test  selection  function  applies  Gorry 's  heuristic  to  a  portion  of  the 
decision  tree  below  this  node  and  determines  the  best  test  to  perform 
at  this  stage  of  the  diagnosis.  The  result  obtained  by  performing  this 
test  leads  to  another  decision  node  of  the  tree,  at  which  point  a  termi¬ 
nal  decision  may  be  made  or  the  test  selection  function  may  again  be 
invoked.  This  process  continues  until  a  terminal  decision  (i.e. ,  a 
final  diagnosis)  is  reached. 
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Now  consider  the  pattern  -sorting  function.  As  we  shall  see  later 
only  those  symptoms  significant  to  the  diagnosis  of  a  particular  disease 
are  associated  with  that  disease  by  a  probability  greater  than  zero. 

For  instance,  the  conditional  probability  of  the  symptom  "sore  elbow" 
given  the  disease  "typhoid  fever"  is  zero  since  a  sore  elbow  is  not  a 
symptom  of  typhoid  fever,  If  the  symptom  "sore  elbow"  were  observed, 
in  the  course  of  a  diagnosis  in  which  typhoid  fever  was  considered  a 
possible  cause  of  all  the  symptoms  observed,  the  posterior  probability 
for  the  disease  typhoid  fever  (as  computer  by  Bayes'  rule)  would  be 
zero,  and  typhoid  fever  would  be  eliminated  from  further  consideration 
even  though  all  other  symptoms  strongly  indicated  the  presence  of  this 
disease.  The  problem  here  is  that  while  a  sore  elbow  is  not  a  symp¬ 
tom  of  typhoid  fever,  a  patient  certainly  can  have  typhoid  fever  and 
a  sore  elbow.  Tnis  is  an  example  of  the  more  general  problem  of 
irrelevant  or  "noise"  symptoms.:  Unless  special  precautions  are  taken, 
such  symptoms  can  eliminate  the  actual  disease  from  consideration 

when  processed  by  the  inference  function. 

A  second  problem  arises  when  symptoms  associated  with  two  or 

more  distinct  diseases  are  observed,  as  when  the  patient  has  more  than 
one  disease.  In  this  case  the  diagnostic  process  must  detect  two  or 
more  patterns  of  symptoms,  rather  than  dismissing  certain  symptoms 
as  irrelevant. 

Gorry's  diagnostic  system  overcomes  these  problems  by  processing 
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a  number  of  symptom  patterns  in  parallel  during  a  diagnosis.  A 
pattern  is  defined  to  consist  of  a  subset  of  the  set  of  symptoms  ob¬ 
served  to  the  current  stage  of  the  diagnosis,  such  that  1)  at  least 
one  disease  exhibits  all  the  symptoms  in  the  pattern  with  a  non-zero 
probability,  and  2)  the  pattern  is  not  a  subset  of  any  other  pattern. 

Each  pattern,  along  with  the  probability  distribution  for  the  diseases 
of  the  patient  given  the  symptoms  oi  the  pattern,  is  included  in  what 
Gorry  calls  a  pattern  stack*,  which  is  maintained  by  the  pattern - 
sorting  function. 

In  addition  to  creating  the  pattern  stack  for  the  initial  set  of  ob¬ 
served  symptoms,  the  pattern-sorting  function  processes  every  new 
symptom  against  the  pattern  stack  and  updates  the  patterns  accord¬ 
ingly.  In  particular,  if  a  new  symptom  is  relevant  to  at  least  one 
disease  already  indicated  by  a  pattern,  the  symptom  is  added  to  the 
pattern  and  its  probability  distribution  is  updated.  If  no  disease 
associated  with  a  pattern  exhibits  the  new  symptom,  no  changes  are 
made  to  either  the  pattern  or  its  probability  distribution.  In  addition, 
after  the  new  symptom  has  been  processed  against  all  patterns  in  the 
pattern  stack,  the  pattern -sorting  function  forms  new  patterns  involving 


Gorry's  use  of  the  term  "stack"  in  this  case  does  not  conform  to  our 
definition  of  the  term.  The  pattern  stack  is  actually  implemented  via 
a  SLIP  list. 
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the  symptom  if  possible. 

Finally,  if  the  inference  function  determines  that  the  probability 
of  a  particular  pattern  is  zero,  the  pattern  sorting  function  eliminates 
the  pattern  and  its  probability  distribution  from  the  pattern  stack. 

In  general  terms,  Gorry’s  three  functions  interact  in  the  fol¬ 
lowing  manner.  The  diagnostic  system  is  provided  with  a  list  of 
symptoms  which  have  been  observed.  The  pattern- sorting  function 
examines  these,  separates  them  into  patterns,  and  sets  aside  for 
later  consideration  those  patterns  which  do  not  appear  to  be  relevant 
to  the  principal  medical  problem  (i.e, ,  the  set  of  most  likely  diseases). 
Next,  the  inference  function,  using  the  a  priori  and  conditional  prob¬ 
abilities  which  constitute  the  "experience"  of  the  diagnostic  system, 
creates  a  probability  distribution  for  the  set  of  diseases  which  exhibit 
the  symptoms  of  the  pattern  under  consideration.  Finally,  the  test 
selection  function  utilizes  the  current  probabilities  of  the  various 
diseases,  the  cost  of  each  test,  and  the  usefulness  of  the  results  of 
each  test  to  select  a  good  test  to  apply  to  the  patient.  (Alternatively, 
the  test  '■"'lection  function  may,  of  course,  suggest  a  final  diagnosis.) 
When  the  results  of  the  test  have  been  obtained,  this  process  is  repeated 
until  a  diagnosis  has  been  obtained. 

6.1.3  The  Data  and  Its  Data  Structures 

Let  us  now  describe  within  the  framework  of  our  data  structure 
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model  the  basic  data  involved  in  the  diagnostic  process.  The  set  of 
data  items,  A0,  may  be  defined  to  consist  of  symptoms,  diseases, 
and  tests.  We  could  also  include  the  costs  of  the  various  tests  and 
the  a  priori  probabilities  of  the  various  diseases  in  the  set  of  data 
items,  but  since  there  is  a  one-to-one  relationship  between  costs 
and  tests  and  between  a  priori  probabilities  and  diseases,  we  choose 
to  include  the  cost  and  the  probabilities  in  each  of  the  descriptions 
(i.e. ,  definitions)  of  the  tests  and  the  diseases,  respectively. 

If  we  let  S°=  {Sjlisl^,**’^}  represent  the  set  of  all  symptoms, 
D°**{d.  |i=l,  2, . . .  n^  the  set  of  diseases,  and  T°={ti  |i=l,  2, . . .  nt}  the 
set  of  possible  tests,  then  a°=S°UD° L/T°. 

The  relations  of  interest  may  now  be  defined  as  follows: 


relevant  to  disease  d^} 


r2  =  {(tp Sj)  I  tj6 T°.  s.eS0,  and  symptom  s, 
is  a  possible  result  of  test  t}) 
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rj)  =  {^Sx>\)  ^  st€S°,  ^D0,  and  the  conditional 


probability  that  symptom  s.  is  present 
given  disease  is  x^} 


where  is  an  element  of  the  set  of  conditional  prob¬ 
abilities  constituting  the  experience  of  the  diagnostic  system. 

For  these  relations  it  is  clear  that  A  =  n  -  a0. 

Note,  however,  that  by  reversing  the  order  of  the  elements 
in  the  ordered  pair  (s^,  dk)  for  the  relations  i?  we  obtain 
A  =  D°  U  T°  c  a0  and  Ii  =  S°  U  T°  c  A0.  It  may  be 
advantageous  to  make  this  switch  in  order  to  reduce  the 
cardinalities  of  a  and  IT}  but  for  the  purposes  of  our  ex¬ 
ample  we  will  assume  the  relations  as  are  originally  defioea. 
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We  have  not  included  the  costs  of  misdiagnosis  in  this  discussion. 
The  reason  for  excluding  these  costs  is  primarily  that  we  choose  to 
consider  the  determination  of  a  representation  for  these  items  as  a 
separate  problem.  To  be  more  specific,  we  feel  that  the  symptoms, 
diseases,  and  tests  are  very  strongly  interrelated  (via  r°lations  r^, 
r0,  and  the  various  r°)  and,  hence,  should  be  considered  together  in 
determining  a  storage  structure  for  their  representation.  On  the 
other  hand,  we  feel  that  the  costs  of  misdiagnosis  are  not  closely 
associated  with  any  of  these  and,  thus,  should  be  considered  alone 
in  determining  a  storage  structure  for  their  representation.  In  other 
words,  we  feel  that  the  costs  of  misdiagnosis  have  a  data  structure 
sufficiently  distinct  from  the  data  structure  for  the  symptoms,  diseases, 
and  tests  to  warrant  separate  consideration.  This  is  then  an  example 
of  a  problem  which  we  wish  to  partition  into  separate  problems  for 
the  purposes  of  determining  storage  structures  for  the  data  involved. 

Note  that  rather  than  actually  determining  a  storage  structure  for 
the  representation  of  the  costs  of  misdiagnosis  (via  our  procedure),  we 
will  assume  simply  that  these  data  are  represented  in  matrix  form. 

Let  us  now  consider  the  actual  diagnostic  problem  which  we  wish 
to  use  as  our  example.  Warner  and  his  associates  [76  ]  conducted  a 
prolonged  study  of  a  number  of  types  of  congential  heart  disease,  and 
as  a  result,  they  developed  a  disease -symptom  probability  matrix  for 
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33  diseases  (including  "normal")  and 50  symptoms*.  The  lists  of 
symptoms  and  diseases  appear  in  Tables  6-1  and  6-2,  and  the  corre¬ 
sponding  probability  matrix  appears  in  Table  6-3.  Finally,  the  list 
of  tests  and  their  respective  possible  results  appears  in  Table  6-4. 

In  an  attempt  to  generalize  this  problem  somewhat  we  will  map 
all  probabilities  of  the  disease -symptom  probability  matrix  which  lie 
within  the  range  x  -  0. 025  to  x  +  0. 025  to  the  value  x,  where  x  is  some 
multiple  of  0.05.  That  is,  to  every  probability  y  which  satisfies  the 
condition  x  -  0. 025  <  y  <  x  +  0. 025  we  will  assign  a  new  value  x.  For 
example,  0.06  and  0.07  will  become  0.05,  and  0.08  and  0.09  will 
become  0. 10. 

Most  of  the  probabilities  in  the  matrix  are  already  multiples  of 
0. 05  so  this  has  little  effect  upon  the  actual  values  therein  (or  their 
apparent  validity).  There  are,  however,  a  large  number  of  entries 
with  the  values  0.01  and  0.02,  and  these  will  be  mapped  to  0.  This 
should  not  adversely  affect  the  diagnostic  capability  of  the  system  and 
serves  only  to  reduce  the  number  of  diseases  which  reflect  a  given 
symptom.  It  is  hoped  that  the  resultant  interrelationships  more 
accurately  reflect  those  of  a  more  general  diagnostic  problem,  that  is, 

* 

In  his  consideration  of  this  same  problem  Gorry  apparently  had 
access  to  Warner's  updated  matrix,  for  he  considers  35  diseases  and 
57  attributes.  The  differences  should  be  relatively  slight,  however. 
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Symptom 


Interpretation 


|^S07 

508 

509 

510 

511 

512 

513 
SI  4 
SI  5 
S16 

[S17 

S18 

[si9 

[S20 


IS21 

S22 

^S23 

524 

525 


Age,  less  than  1  year 
Age,  1  year  to  20  years 
Age,  20  years  or  more 
Cyanosis,  mild 

Cyanosis,  severe  (with  clubbing) 

Cyanosis,  intermittent 

Cyanosis,  differential 

Squatting 

Dyspnea 

Easy  fatigue 

Orthopnea 

Chest  pain 

Repeated  respiratory  infections 
Syncope 

Systolic  murmur,  loudest  at  apex 

Diastolic  murmur,  loudest  at  apex 

Systolic  murmur,  loudest  L  4th 

Diastolic  murmur,  loudest  L  4th 

Continuous  murmur,  loudest  L  4th 

Systolic  murmur, with  thrill,  loudest 
L  2nd 

Systolic  murmur  without  thrill,  loudest 
L  2nd 

Diastolic  murmur,  loudest  L  2nd 
Continuous  murmur,  loudest  l  2nd 
Systolic  murmur,  loudest  d  2nd 
Diastolic  murmur,  loudest  R  2nd 


Table  6-1 »  Symptoms  for  Congenital  Heart  Disease 
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Symptom 


Interpretation 


1  S47 

548 

549 


S50 


Systolic  murmur  heard  best  posterior  chest 
Continuous  murmur  heard  best  posterior  chest 
Accentuated  2nd  heart  sound,  L  2nd 
Diminished  2nd  heart  sound,  L  2nd 
Right  ventricular  hyperactivity  by  palpation 
Forceful  apical  thrust 
Pulsatile  liver 

Absent  or  diminished  femoral  pulsation 

ECG  axis  more  than  110° 

ECG  axis  less  than  0° 

R  wave  greater  than  1„  2  mv  in  lead 

R*  or  qR  pattern  in  lead 

R  wave  greater  than  2.0  mv  in  lead  Vg 

T  wave  in  lead  Vg  inverted  (no  digitalis) 

Early  diastolic  murmur,  loudest  at  apex 

Late  diastolic  murmur,  loudest  at  apex 

Holo-systolic  murmur,  loudest  L  4th 

Mid-systolic  murmur,  loudest  L  4th 

Holo-diastolic  murmur,  loudest  L  4th 

Early -diastolic  murmur,  loudest  L  4th 

Mid-systolic  murmur  with  thrill,  Rudest 
L  2nd 

Holo-systolic  murmur  without  thrill, 
loudest  L  2nd 

Mid-systolic  murmur  without  thrill, 
loudest  L  2nd 

Holo-systolic  murmur  without  thrill, 
loudest  L  2nd 

Murmur  louder  than  gr  3/6 


Brackets  indicate  mutually  exclusive  symptoms. 

Table  6-1.  Symptoms  for  Congenital  Heart  Disease  (Cont.) 
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Disease 


Interpretation 


D01 
DO  2 
D03 
DO  4 

D05 

D06 

D07 

D08 

D09 

DIO 

Dll 

D12 

D13 

D14 

D15 

D16 

D17 

D18 

D19 

D20 

D21 

D22 

D23 

D24 

D25 


Normal 

Atrial  septal  defect 

Atrial  septal  defect  with  pulmonary  stenosis 

Atrial  septal  defect  with  pulomonary  hyper¬ 
tension 

Complete  endocardial  cushion  defect 

Partial  anomalous  pulmonary  venous 
connections 

Total  anomalous  pulmonary  venous  connections 
Tricuspid  atresia  without  transposition 
Ebstein’s  anomaly 

Ventricular  septal  defect  with  valvular 
pulmonary  stenosis 

Ventricular  septal  ,  feet  with  infundibular 
stenosis 

Pulmonary  stenosis,  valvular 

Pulmonary  stenosis,  infundibular 

Pulmonary  atresia 

Pulmonary  artery  stenosis 

Pulmonary  hypertension 

Aortic -pulmonary  window 

Patent  ductus  arteriosus 

Pulmonary  arteriovenous  fistula 

Mitral  stenosis 

Primary  myocardial  disease 

Anomalous  origin  of  left  coronary  artery 

Aortic  valvular  stenosis 

Subaortic  stenosis 

Coarctation  of  aorta 


Table  6-2.  Heart  Disease  Types 
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•  "■'»*** 


Disease 


Interpretation 


D26 

D27 

D28 

D29 

D30 

D31 

D32 

D33 


Table  6-2.  Heart  Disease  Types  (Cont.) 


Truneus  arteriosus 

Transposed  great  vessels 

Corrected  transposition 

Absent  aortic  arch 

Ventricular  septal  defect 

Ventricular  septal  defect  with  pulmonary 
hypertension 

Patent  ductus  arteriosus  with  pulmonary 
hypertension 

Tricuspid  atresia  with  transposition 
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SYMPTOMS 


Diseases 

incidences 

D01 

0.100 

D02 

.081 

D03 

.005 

DO  4 

.001 

D05 

.027 

D06 

.005 

DO  7 

.001 

D08 

.018 

D09 

.001 

DIO 

.054 

Dll 

.063 

D12 

.045 

D13 

.013 

D14 

.014 

D15 

.001 

D16 

.013 

D17 

.001 

D18 

.072 

D19 

.002 

D20 

.008 

D21 

.013 

D22 

.001 

D23 

.038 

D24 

.009 

D25 

.054 

D26 

.005 

D27 

.063 

D28 

.001 

D29 

.001 

D30 

.252 

D31 

.081 

D32 

.005 

D33 

.009 

SOI 

SO  2 

S03 

S04 

01 

49 

50 

01 

10 

50 

50 

02 

30 

60 

10 

20 

10 

20 

70 

30 

20 

50 

30 

15 

10 

40 

50 

01 

20 

70 

10 

65 

50 

48 

02 

30 

10 

45 

45 

22 

40 

55 

05 

25 

40 

55 

05 

30 

20 

70 

10 

01 

20 

70 

10 

01 

90 

09 

01 

10 

05 

45 

50 

01 

10 

45 

45 

01 

30 

60 

10 

05 

20 

40 

40 

01 

20 

30 

50 

45 

2C 

50 

30 

01 

70 

29 

01 

01 

70 

29 

01 

01 

10 

80 

10 

01 

10 

80 

10 

01 

10 

70 

20 

01 

50 

40 

10 

30 

90 

10 

00 

20 

30 

30 

30 

30 

60 

39 

01 

01 

15 

70 

15 

01 

30 

60 

10 

30 

30 

40 

30 

01 

40 

55 

C5 

50 

S05 

SO  6 

S07 

S08 

00 

01 

00 

01 

01 

02 

00 

01 

10 

20 

00 

01 

10 

25 

00 

01 

05 

10 

00 

01 

01 

01 

00 

Oi 

10 

05 

00 

01 

65 

01 

00 

10 

44 

01 

00 

22 

25 

10 

00 

30 

30 

10 

00 

40 

01 

01 

00 

01 

01 

00 

00 

01 

90 

00 

00 

80 

01 

01 

00 

01 

01 

01 

00 

01 

01 

01 

00 

01 

01 

01 

CO 

01 

45 

01 

00 

01 

01 

01 

00 

01 

01 

01 

00 

01 

01 

01 

00 

01 

01 

01 

00 

01 

01 

01 

00 

01 

01 

01 

00 

01 

60 

01 

00 

15 

60 

05 

10 

05 

05 

10 

00 

01 

01 

01 

80 

30 

01 

01 

00 

01 

50 

10 

00 

05 

01 

05 

50 

01 

20 

10 

00 

01 

Table  6-c$.  Symptom -Disease  Probability  Matrix 
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Diseases 

S09 

S10 

Sll 

D01 

01 

10 

03 

D02 

35 

50 

05 

DO  3 

60 

70 

05 

DO  4 

80 

90 

05 

DO  5 

40 

50 

05 

DO  6 

15 

20 

01 

D07 

70 

80 

05 

D08 

80 

90 

20 

D09 

80 

80 

10 

DIO 

75 

90 

05 

Dll 

75 

90 

05 

D12 

50 

65 

01 

D13 

50 

65 

01 

D14 

90 

99 

05 

D15 

01 

01 

01 

D16 

70 

95 

40 

D17 

10 

10 

05 

D18 

20 

20 

10 

D19 

10 

20 

05 

D20 

50 

50 

40 

D21 

40 

50 

20 

D22 

30 

30 

30 

D23 

20 

30 

20 

D24 

20 

30 

20 

D25 

20 

30 

20 

D26 

15 

30 

05 

D27 

60 

70 

20 

D28 

10 

20 

01 

D29 

10 

50 

05 

D30 

20 

30 

05 

D31 

60 

70 

20 

D32 

20 

30 

10 

D33 

80 

90 

20 

S12  S13  S14  S15  SI 6 

05  05  03  05  01 

02  40  01  02  02 

02  10  10  02  02 

05  15  10  0Z  02 

05  30  05  60  15 

05  05  01  02  02 

05  20  05  02  02 

05  15  10  02  05 

30  15  22  05  25 

05  10  20  02  02 

05  10  25  02  02 

01  01  10  02  02 

01  01  10  02  02 

10  05  35  02  02 

01  01  01  04  01 

10  10  10  01  01 

01  10  01  05  10 

01  10  05  05  15 

01  01  10  05  02 

05  10  10  80  20 

01  05  05  15  02 

80  15  20  05  01 

15  01  35  20  02 

15  01  35  20  02 

01  01  05  05  01 

01  20  10  02  02 

01  05  10  05  02 

01  01  01  05  02 

20  01  20  05  02 

01  15  05  05  20 

10  20  10  05  01 

01  10  05  0?  02 

01  30  05  05  10 


Table  6-3.  Symptom -Direase  Probability 
Matrix  (Cont.) 


Diseases  S17  S18  S19  S20  S21  S22 


D01  70  02  07  00  80  01 

D02  30  20  02  05  90  02 

D03  05  05  02  57  40  01 

D04  15  20  02  05  40  20 

D05  90  40  02  10  20  10 

DO  6  20  02  02  02  60  05 

D07  10  15  10  05  75  05 

D08  6b  05  05  20  20  02 

D09  95  25  05  05  15  02 

DIO  20  02  05  65  25  02 

Dll  20  02  05  65  25  02 

D12  10  02  05  70  20  02 

D13  10  02  02  70  20  02 

D14  40  05  05  01  02  02 

D15  02  01  01  02  ^5  02 

D16  30  05  01  01  05  30 

D17  20  05  60  01  10  05 

D18  10  02  50  02  13  05 

D19  10  02  20  02  10  02 

D20  10  10  02  05  10  02 

D21  05  02  02  02  05  02 

D22  01  01  01  01  01  01 

D23  20  10  02  05  05  01 

D24  20  10  02  05  05  01 

D25  20  10  02  02  10  01 

D26  70  02  02  10  10  02 

D27  50  02  02  03  10  02 

D28  70  02  02  05  30  02 

D29  50  02  02  10  30  02 

D30  95  05  02  10  10  05 

D31  50  10  02  05  C5  25 

D32  10  10  02  02  20  10 

D33  70  05  02  10  30  10 


Table  6-3.  Symptom -Disease  Probability 
Matrix  (Cont.) 


Diseases 

S23 

S24 

S25 

S26 

S27 

S28 

S29 

S30 

D01 

05 

01 

00 

01 

01 

15 

05 

10 

DO  2 

02 

01 

01 

01 

01 

80 

01 

80 

DO  3 

03 

01 

01 

01 

02 

30 

15 

40 

D04 

01 

01 

01 

01 

01 

95 

01 

50 

D05 

01 

01 

01 

01 

01 

70 

02 

40 

DO  6 

05 

01 

01 

10 

15 

40 

02 

10 

D07 

20 

01 

01 

10 

15 

85 

02 

80 

D08 

05 

01 

01 

01 

01 

02 

60 

01 

D09 

05 

01 

01 

01 

01 

02 

35 

10 

DIO 

05 

02 

02 

10 

15 

10 

60 

20 

Dll 

05 

02 

02 

10 

15 

10 

60 

20 

D12 

10 

02 

02 

01 

01 

10 

60 

20 

D13 

02 

02 

02 

01 

01 

10 

60 

20 

D14 

05 

02 

02 

10 

10 

01 

90 

20 

D15 

01 

20 

02 

50 

05 

10 

02 

10 

D16 

02 

02 

02 

02 

02 

95 

00 

30 

D17 

20 

02 

02 

02 

02 

70 

01 

20 

D18 

85 

02 

02 

03 

05 

50 

01 

20 

D19 

05 

01 

01 

05 

70 

OF 

05 

20 

D20 

02 

02 

02 

01 

01 

5  J 

01 

20 

D21 

02 

10 

02 

01 

01 

20 

02 

10 

D22 

01 

01 

01 

Cl 

01 

20 

02 

01 

D23 

01 

95 

05 

01 

01 

20 

10 

01 

D24 

01 

95 

05 

01 

01 

20 

10 

01 

D25 

05 

15 

10 

80 

15 

10 

10 

01 

D26 

02 

02 

02 

05 

10 

40 

10 

30 

D27 

02 

05 

02 

01 

01 

20 

10 

20 

D28 

02 

05 

02 

01 

01 

20 

10 

10 

D29 

02 

05 

02 

01 

01 

90 

0? 

40 

D30 

01 

02 

05 

01 

01 

30 

02 

05 

D31 

01 

02 

05 

01 

01 

90 

02 

30 

D32 

02 

02 

02 

02 

02 

90 

02 

30 

D33 

02  02 

Table  6-3. 

01  01  01  30  10 

Symptom -Disease  Probability 

01 

Matrix  (Cont.) 
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Diseases  S31  S32  S33  S34  S35  S36  S37 

D01  03  01  01  01  02  02  02 

D02  01  01  01  70  05  05  85 

D03  01  05  01  85  05  20  70 

D04  01  05  01  85  05  20  70 

D05  10  10  01  05  70  05  85 

D06  01  01  01  15  02  02  15 

D07  01  01  01  90  02  25  75 

D03  20  30  01  02  90  02  02 

D09  20  10  01  10  02  02  60 

DIO  01  02  01  95  02  85  10 

Dll  01  02  01  95  02  85  10 

D12  01  05  01  95  02  85  10 

D13  01  05  01  95  02  85  10 

D14  01  02  01  95  02  85  10 

D15  01  01  01  10  02  10  02 

D16  01  10  01  95  02  90  05 

D17  40  01  01  01  15  02  02 

D18  40  C2  01  02  10  02  02 

D19  oi  01  01  05  05  02  02 

D20  05  02  01  50  02  10  40 

D21  50  02  01  05  10  05  05 

D22  05  01  01  05  10  05  05 

D23  40  01  05  05  15  02  02 

D24  40  01  05  05  15  02  02 

D25  30  01  99  05  05  02  02 

D26  05  01  01  30  10  40  10 

D27  20  02  02  40  20  30  05 

D28  io  01  01  20  10  10  10 

D29  05  01  10  70  05  80  05 

D30  30  01  01  30  10  05  05 

D31  05  05  01  70  05  75  15 

D32  05  05  01  70  05  75  15 

D33  20  30  01  02  90  02  02 


Table  6-3.  Symptom-Disease  Probability 
Matrix  (Cont. ) 
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Diseases 


D01 
D02 
DO  3 
DO  4 
D05 
D06 
D07 
D08 
D09 

DIO 

Dll 

D12 

D13 

D14 

D15 

D16 

D17 

D18 

D19 

D20 

D21 

D22 

D23 

D24 

D25 

D26 

D27 

D28 

D29 

D30 

D31 

D32 

D33 


S3  8 

S39 

02 

02 

02 

02 

02 

02 

02 

02 

02 

02 

02 

02 

02 

02 

90 

10 

02 

02 

02 

02 

02 

02 

02 

02 

02 

02 

02 

02 

02 

02 

02 

02 

60 

05 

50 

05 

02 

02 

02 

02 

40 

90 

20 

90 

70 

15 

70 

15 

40 

04 

20 

05 

20 

05 

10 

10 

10 

05 

15 

05 

10 

05 

10 

05 

90 

10 

Table  6-3. 


S40  S41 


01  00 

01  02 

01  01 

01  02 

15  01 

02  02 

02  02 

05  02 

25  25 

02  02 

02  02 

01  01 

01  01 

02  01 

01  01 

01  01 

10  02 

10  02 

02  02 

20  20 

02  02 

01  01 

02  02 

02  02 

01  01 

02  02 

02  02 

02  02 

02  02 

20  02 

01  01 

02  02 

10  02 


S42  S43 


02  70 

01  30 

01  05 

01  15 

85  05 

02  20 
30  10 

50  15 

45  45 

20  05 

20  05 

01  10 
01  io 

30  40 

02  02 
01  30 

10  20 
05  10 

10  10 
10  10 
10  10 
01  01 
02  20 
02  20 
05  20 
40  40 

30  30 

30  30 

30  30 

92  05 

30  30 

10  10 
30  30 


S44 

S45 

04 

03 

02 

20 

01 

05 

20 

02 

02 

20 

02 

02 

01 

30 

05 

02 

25 

25 

02 

02 

02 

02 

02 

02 

01 

01 

02 

05 

01 

00 

15 

05 

05 

02 

02 

02 

0? 

02 

10 

10 

02 

02 

01 

01 

10 

02 

10 

02 

10 

02 

02 

02 

02 

02 

02 

02 

02 

02 

05 

01 

10 

02 

02 

02 

05 

05 

Symptom -Disease  Probability 
Matrix  (Cont. ) 
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Diseases  S46 


S47  S48  S49  S50 


D01 

00 

00 

80 

05 

10 

D02 

05 

01 

90 

01 

60 

DO  3 

60 

01 

38 

01 

70 

DO  4 

05 

01 

40 

01 

40 

DO  5 

02 

20 

20 

•20 

80 

DO  6 

02 

02 

60 

02 

30 

D07 

05 

01 

80 

02 

70 

DO  8 

20 

20 

20 

20 

50 

D09 

15 

15 

05 

05 

50 

DIO 

60 

05 

25 

05 

90 

Dll 

60 

05 

25 

05 

90 

D12 

68 

01 

25 

01 

80 

D13 

68 

01 

25 

01 

80 

D14 

01 

01 

02 

02 

20 

D15 

02 

01 

25 

02 

60 

D16 

02 

02 

05 

02 

20 

D17 

02 

02 

10 

05 

75 

D18 

05 

02 

20 

10 

85 

D19 

02 

02 

10 

10 

30 

D20 

05 

05 

10 

10 

70 

D21 

02 

02 

05 

05 

10 

D22 

01 

01 

01 

01 

10 

D23 

05 

01 

05 

01 

90 

D24 

05 

01 

05 

01 

90 

D25 

02 

02 

10 

05 

65 

D26 

10 

10 

10 

10 

40 

D27 

03 

03 

10 

10 

50 

D28 

05 

05 

30 

30 

60 

D29 

10 

10 

30 

30 

20 

D30 

01 

10 

01 

10 

85 

D31 

01 

05 

01 

05 

50 

D32 

02 

02 

20 

20 

20 

D33 

10 

10 

30 

30 

50 

Table  6-3.  Symptom -Disease  Probability 
Matrix  (Cont.) 
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Test 

Possible  Results 

T01 

SOI,  S02,  S03 

TO  2 

S04,  S05,  S06,  S07,  N 

T03 

S08,  N 

T04 

S09,  N 

T05 

S10,  N 

T06 

Sll,  N 

T07 

SI  2,  N 

T08 

S13,  N 

TOO 

S14,  N 

T10 

SI  5.  N 

TU 

S16,  N 

T12 

S17,  S18,  S19,  N 

T13 

S20,  S21,  S22,  S23,  N 

T14 

S24,  N 

T15 

S25,  N 

T16 

S26,  S27,  N 

T17 

S28,  S29,  N 

T18 

S30,  N 

T19 

S31,  N 

T20 

S32,  N 

T21 

S33,  N 

T22 

S34,  S35,  N 

T23 

S36,  S37,  N 

T24 

S38,  N 

T25 

S39,  N 

T26 

S40,  S41,  N 

T27 

S42,  S43,  N 

T28 

S44,  S45,  N 

T29 

S46,  S47,  S48,  S49,  N 

T30 

S50,  N 

N  means  "normal" 

Table  6-4. 

Tests  for  Heart  Disease  Diagnosis 
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L 


one  which  encompasses  a  greater  variety  of  diseases  and  symptoms. 

Using  this  new  probability  matrix  along  with  the  numbers  of 
symptoms  ,  diseases,  and  tests,  we  will  determine  values  for  the 
parameters  which  characterize  our  data  structure  model:  namely, 

ka’  kp’  kr'  mr  ’  311(1  k°l* * *k  10* 

P 

Since  IS°1  =50,  ID0 1  =33,  and  lT°I  =30,  we  know  that 
k  a  =  k d  =  From  tiie  probability  matrix  we  see  that  there  are 

n>  r 

20  relations  r9 :  r°  for  0.05,  r^for  0.10,**  • ,  and  r°Q  for  1.00. 
Therefore,  k°  =  22. 

For  each  "type”  of  relation  (i.e. ,  r°,  r^,  and  r g)  we  have  compiled 
certain  statistics  which  appear  in  Table  6-5.  If  the  totals  for  items 
1  and  2  and  those  for  items  3  and  4  are  divided  by  the  number  of 
sources  and  the  number  of  targets,  respectively,  and  if  the  number 
of  sources  and  the  number  of  targets  are  divided  by  the  totals  for 
items  5  and  6,  respectively,  we  obtain  the  averages  contained  in 
Table  6-6. 

We  can  also  determine  from  inspection  of  our  given  information  that 

o 

there  are  no  identical  (shared)  target  sets  (II  's)  and  no  two  sources 
have  a  relation/target  set  pair  in  common.  These  two  facts  imply 
that  k°  =  1  and  k°  =  1,  respectively.  The  fact  that  k°  =  1  implies  that 
k °  =  1,  also. 

Items  5  and  6  of  Table  6-6  correspond  to  k°  and  k°Q,  respectively. 
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Table  6-5.  Statistics  for  Heart  Disease  Problem 


14.36 


1)  Average  number  of  relation/ 
target  pairs  associated  with 
a  given  source 


2)  Average  number  of  distinct  3.50 

relations  associated  with  a 

given  source 

3)  Average  number  of  source/  14.36 

relation  pairs  associated 

with  a  given  target 


4)  Average  number  of  distinct  3. 60 

relations  associated  with  a 
given  target 


5)  Average  number  of  sources  1.00 

in  a  given  source  subset 

6)  Average  number  of  targets  1.30 

in  a  given  target  subset 


Table  6-6.  Averages  for  Heart  Disease  Problem 
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Therefore,  k^  =  1  and  k°g  =  l.°0.  Item  2  of  Table  6-6  corresponds 

and  since  k°  =  1 .  we  find  that  k°.  =  3. 50.  Item  1 
2  >  4 

represents  the  product  k^k^k^k^k®  and  since  we  know  the  values  of 
all  these  factors  except  k° ,  we  can  determine  that  k°  =  3.13.  Simi¬ 
larly,  item  3  represents  the  product  k°k°k gk^kg  from  which  we 
determine  that  k°  =  14. 36. 

This  leaves  us  with  only  m  remaining  to  be  determined.  Recall 

rp 

that  m  represents  the  expected  number  of  times  the  same  relation 

rp 

symbol  is  associated  with  a  given  target.  Since  k^k^  =14. 36,  we 

know  that  associated  with  each  target  there  are  on  the  average  14.36 

relations  which  are  not  necessarily  distinct.  Item  4  of  Table  6-6 

indicates,  however,  that  there  are  an  average  of  only  3.  60  distinct 

relations  associated  with  a  given  target.  Hence,  mr  =14. 36/3. 60=3.99. 

P 

For  convenience  all  of  these  results  are  summarized  in  Table  6-7. 
Since  our  prototype  program  uses  fixed-point  arithmetic  in  most 
of  the  low-level  routines,  we  require  that  the  information  of  Table  6-7 
be  in  integer  form.  This  implies  transformation  of  these  results  to 
integer  values  approximating  the  original  values  and  satisfying  the 
given  constraints.  (Clearly,  we  may  not  simply  round  off  the  values.) 
Table  6-8  contains  one  possible  choice  for  the  transformed  values. 


to  the  product  k^k^ 
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k°  = 
a 

113 

k0p  = 

113 

ko  = 
r 

22 

m  = 
r 

P 

3.99 

ki  = 

1 

k°2  = 

1 

ii 

& 

1 

k°  - 

3.50 

k  7 

4 

n  = 

3.13 

k°  - 
9  " 

14.36 

k°  - 
10 

1.30 

Table  6-7.  Summary  of  Parameter  Values  for  Heart  Disease  Problem 
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Table  6- 


k°  =  113 

a 

k°  =  113 

P 

k°  =  22 


m  =  4 

r 


k°  =  3 

4 


“°7  = 
k°8  = 


k°  = 
10 


1 

3 

18 

2 


Transformed  Parameters  for 
Heart  Disease  Problem 
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6. 1.  4  The  Operations 


We  are  now  concerned  with  estimation  of  the  relative  frequencies 
of  the  operations  to  be  used  in  the  diagnostic  process. 

In  pattern  sorting  two  basic  operations  appear:  namely,  (1) 
determine  the  diseases  associated  with  a  given  symptom,  and  (2) 
determine  the  symptoms  associated  with  a  given  disease.  The  first 
of  these  is  used  to  determine  those  diseases  which  may  be  indicated 
by  the  set  of  symptoms  in  a  given  pattern,  and  the  second  is  then  used 
to  determine  the  intersection  of  the  symptoms  associated  with  each  of 
these  diseases  with  the  set  of  given  symptoms.  (Note  that  if  each  time 
the  first  operation  is  performed  the  symptom  leading  to  a  particular 
disease  is  recorded  for  that  disease,  then  there  is  no  need  for  the 
second  operation.) 

Gorry  indicates  that  his  program,  operating  in  the  sequential 
diagnosis  mode,  was  given  on  the  average  7  initial  symptoms  and  per¬ 
formed  an  average  of  5. 8  tests  (each  of  which  yields  one  symptom  as  a 
result).  We  may,  therefore,  assume  that  the  first  operation  above  is 
performed  on  the  average  12.8  times  in  the  course  of  a  solution. 

Since  each  symptom  can  indicate  on  the  average  18. 7  diseases 
(from  the  symptom-disease  matrix),  we  can  probably  assume  that  most, 
if  not  all,  diseases  must  be  considered  for  the  seconi  operation.  We 
will  assume  conservatively  '.hat  25  diseases  are  involved  and,  hence, 
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that  the  second  operation  io  performed  25  times  in  the  course  of  a 
solution. 

These  two  operations  correspond  to  our  primitive  operations 
^12^i* 8,1(1  Qi4(-*Pk),  respectively.  Thus,  QJ2  is  performed 
on  the  average  12.8  times  and  Q^,  25  times. 

The  inference  process  (i.  e. ,  the  applicat.'  on  of  Bayes*  rule)  also 
involves  two  basic  operations:  (1)  determine  the  a  priori  probability 
of  a  given  disease,  and  (2)  determine  the  conditional  probability  that 
a  particular  symptom  is  associated  with  a  given  disease.  Since  we 
have  assumed  that  the  a  priori  probability  of  a  given  disease  is  part 
of  the  disease  description  itself  (and,  hence,  is  contained  within  the 
disease  description  block),  we  need  not  concern  ourselves  with  the 
first  operation  as  it  cannot  affect  the  structure. 

The  inference  function  processes  each  observed  symptom  against 
every  disease  in  the  pattern  stack.  If  the  observed  svmptom  is  relevant 
to  at  least  one  disease  associated  with  a  given  pattern,  then  Bayes’ 
rule  is  applied  to  each  disease  associated  with  the  pattern.  The  second 
operation  must  therefore  be  applied  once  ior  every  such  disease  and 
the  given  observed  symptom. 

Since  we  have  no  way  of  validly  estimating  the  average  number  of 
patterns  in  the  pattern  stack  (although  Gorry  does  assure  us  that  this 
number  should  ordinarily  be  quite  1  i r  ited)  but  we  do  know  that  the 


process  should  eventually  result  in  only  one  pattern  (unless  the  patient 
has  two  or  more  unrelated  diseases,  which  we  assume  he  does  not), 
we  will  assume  that  the  pattern  stack  always  contains  only  one  pattern. 
Our  next  problem  is  to  determine  the  number  oi  diseases  associated 
with  the  pattern  on  the  average.  Since  each  symptom  can  indicate  an 
average  of  18. 7  diseases,  we  will  assume  that  the  initial  pattern  con¬ 
sisting  of  the  first  observed  symptoms  has  that  many  diseases  asso¬ 
ciated  with  it.  At  the  end  of  ch ;  diagnostic  process,  Gorry’s  statistics 
for  this  problem  show  an  average  of  4. 1  diseases  having  probabilities 
greater  than  a  given  threshold  (0. 01).  Let  us  assume  then  that  the 
pattern  remaining  at  the  end  of  the  diagnosis  has  4. 1  diseases  asso¬ 
ciated  with  it.  At  the  risk  of  offending  the  purist  let  us  assume  that 
the  average  number  of  diseases  associated  with  a  pattern  throughout 
the  course  of  the  diagnosis  is  the  average  of  the  average  numbers  for 
the  initial  and  final  patterns,  which  is  11.4. 

Since  on  the  average  12.8  symptoms  are  considered  in  the  course 
of  the  diagnosis,  we  can  say  that  the  second  operation,  which  corre¬ 
sponds  to  primitive  operation  0^ (d^^-p^) ,  is  performed  (1 2. 8)(11 . 4)  = 
145. 9  times. 

The  test  s  election  function  determines  the  set  of  tests  which  are 
relevant  to  the  diseases  whose  posterior  probabilities  at  the  current 
decision  node  each  excerd  a  given  threshold  (i.e. ,  the  most  likely 
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diseases  so  far).  Excluded  are  those  tests  which  may  already  have 
been  considered.  Then  for  each  possible  result  of  each  test  the  in¬ 
ference  function  is  invoked.  The  resulting  probabilities  along  with 
the  costs  of  tests  and  the  loss  function  matrix  are  used  to  determine 
which  test  should  actually  be  performed. 

We  see  then  that  test  selection  involves  the  following  basic  opera¬ 
tions:  (1)  determine  the  tests  relevant  to  the  given  diseases,  (2) 
determine  the  possible  outcomes  of  each  test,  (3)  detei  mine  the  cost 
of  each  test,  (4)  determine  certain  elements  of  the  loss  matrix,  and 
the  inference  function  basic  operations. 

Since  we  have  assumed  that  the  cost  of  a  given  test  is  part  of  the 
test  description,  we  need  not  concern  ourselves  with  its  determination. 
Since  the  loss  function  matrix  is  not  part  of  the  structure  under  con¬ 
sideration  (it  is  a  matrix),  we  may  ignore  determination  of  its  elements 
also.  Thus,  we  need  concern  ourselves  only  with  determining  the  tests 
and  tneir  outcomes  and  the  conditional  probabilities  that  particular 
symptoms  are  associated  with  given  disease. 

Gorry’s  results  indicate  teat  for  this  problem  an  average  of  350 
decision  nodes  are  encountered  in  the  solution  process.  Let  us  assume 
as  we  did  in  considering  the  inference  process  that  there  are  an  aver¬ 
age  of  11. 4  diseases  of  interest  at  each  decision  node.  Let  us 
further  assume  that  only  3  of  these  diseases  have  posterior  probabilities 
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greater  than  the  given  threshold.  (It  would  be  unlikely  that  all  the 
diseases  are  equally  probable.)  From  Tables  6-3  and  6-4  we  can 
determine  that  there  are  an  average  of  19. 3  tests  which  are  pertinent 
to  each  disease  and  each  test  has  an  average  of  1. 7  possible  outcomes 
(symptoms)  other  than  "normal”. 

Then  the  first  of  the  test  selection  operations  will  be  performed 
3  times  at  each  decision  node,  or  a  total  of  1050  times. 

Although  each  disease  has  19.3  tests  associated  with  it  on  the 
average,  there  may  be  considerable  overlap  between  the  sets  of  tests 
for  two  different  diseases.  Let  us  assume  that  the  3  diseases  of  interest 
at  each  decision  node  have  22  distinct  tests  associated  with  them  out 
of  the  30  possible  tests.  Since  some  of  these  may  already  have  been 
considered  (by  the  end  of  the  solution  process  a  total  of  12. 8  will  have 
been  considered),  let  us  assume  that  15  remain  to  be  considered  at  any 
given  node.  Then  the  second  of  the  test  selection  operations  will  be 
performed  15  times  for  each  decision  node,  or  a  total  of  5250  times. 

Note  that  these  two  operations  are  both  of  the  form  of  primitive 

ooeration  Qq  (d.r,-).  Therefore,  Q„will  be  performed  a  total  of  6300 
o  l  3  o 

times  during  the  solution  process. 

Since  each  test  has  an  average  of  1. 7  outcomes  there  will  be  on 
the  average  (1.  7)  (15)  =  25  symptoms  to  be  considered  at  each  decision 
node,  which  implies  that  operation  (of  the  inference  function)  mast 
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be  performed  75  times  at  each  node,  or  a  total  of  26, 250  times. 
will  then  be  performed  a  grand  total  of  26,396  times. 

To  summarize,  the  following  primitive  operations  are  involved 
in  the  solution  process  with  the  indicated  (rounded-off)  absolute  and 


relative  frequencies: 

Relative 

Operation 

Frequency 

Frequency 

9j  <dirr> 

6300 

0.192 

q7  «v*> 

26396 

0.806 

13 

0.001 

v-v 

25 

0.001 

32734 


6. 1. 5  The  Environment 


As  a  final  step  before  submitting  oi  :  problem  to  the  solution 
process  and  determining  the  optimal  storage  structure  for  it,  we 
must  assign  values  to  the  unspecified  parameters  and  the  external 
decision  variables. 

As  for  the  external  decision  variables,  let  us  assume  cr^-  0, 

ct2=  1,  Pj  =  0,  pg=  1,  and  Pg  =  0.  The  choice  of  these  values  is  largely 

an  arbitrary  one,  but  it  does  have  an  intuitive  justification.  Since  the 

sets  A  and  n  are  not  disjoint  ,  we  must  handle  the  "wrap-around" 

problem  by  either  requiring  at  least  one  of  and  to  be  non-zero 

or  requiring  some  0.  to  je  non-zero.  Rather  than  constrain  the  p.'s 

1  1 
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in  this  manner,  we  choose  to  set  a g=  1.  Since  for  the  problem  under 
consideration  there  appears  to  be  no  advantage  to  be  gained  from 
setting  both  and  to  one,  we  choose  to  set  o^  =  0. 

The  values  assigned  to  p^,  pg,  and  p^  are  simply  the  ’’natural'’ 
choices.  A  relation  symbol  name  appears  only  once  for  each  use 
when  p^  =  0,  pg=  1  and  Pg=  0,  and  it  is  equally  accessible  from  both 
sources  and  targets. 

Consider  now  the  various  unspecified  parameters.  First  let  us 
assume  that  our  structure  is  uniform  in  nature.  That  is,  let  us  assume 
that  all  pointers  in  the  structure  occupy  the  same  size  fields  and  that 
they  all  require  the  fame  number  of  time  units  to  follow;  let  us  assume 
that  all  stepping  operations  for  stacks  require  the  same  number  of  time 
units  for  execution;  and  so  forth.  More  specifically,  let  the  following 
conditions  be  true: 


L  =  ...-r  =h.  =•••  =h1n=f  =f  =F  =F  -F 
1  10  1  lQaparp 


Vsi  ="'  =  sio 


c,  =c  and  v,  =v 
dr  dr 


T  =  T  =  T 
a  r  p 


u  =  u 
a  p 


s  =  s  ,  =  s,  =  •  ° • 
P  d  U 


~  Sf  s  h 

f10  1 


259 


For  further  convenience  let  us  also  assume  that  T  ,  T  ,  and  T 

a’  r*  p 

represent  simply  the  times  required  to  follow  pointers  and,  hence,  are 
equal  to  Fft,  Fr,  F  ,  etc. 

Not  included  in  the  conditions  above,  but  a’ so  in  need  of  values 
are  the  parameters  u^,  ur,sr,st,  and  SQ. 

In  order  to  rigorously  specify  the  environment  (i.e. ,  the  computer) 
in  which  the  diagnostic  system  is  to  reside  and  operate,  we  will  assume 
it  to  be  an  IBM  360/67,  with  which  we  assume  the  reader  to  be  familiar. 
With  this  in  mind  let  us  briefly  examine  the  characteristics  of  each  of 
the  quantities  to  when  we  must  assign  a  value. 

The  act  of  following  a  pointer  simply  involves  replacing  the  value 
of  the  position  indicator  (described  in  Chapter  IV)  by  the  contents  of 
the  field  to  which  that  value  points,  where  this  field  is  assumed,  of 
course,  to  contain  a  pointer.  By  letting  the  position  indicator  occupy 
one  of  the  general  registers  of  the  360/67,  this  operation  can  be  imple- 
meted  via  the  load  instruction  (L  R, , -^2(^2* ^2  ^ ^as  311 
average  execution  time  of  1.  20  (is  (microseconds)  [34]  , 

Stepping  from  one  field  to  another  in  a  stack  involves  incrementing 
or  decrementing  the  value  of  the  position  indicator  by  the  size  of  the 
blocks  in  the  stack.  Assuming  that  one  of  the  general  registers  con¬ 
tains  this  size,  the  operation  can  be  implemented  via  an  RR  type  add 
or  subtract  instruction  (AR  Rl,  R2  or  SR  R1  R2,  respectively), 
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each  of  which  has  an  average  execution  time  of  0. 65  jus. 

The  comparison  operations  characterized  by  c^  and  cr  involve 
comparing  the  contents  of  the  field  to  which  the  position  indicator 
points  with  some  given  quantity  and  selecting  the  next  operation  to  be 
performed  on  the  basis  of  the  outcome  of  this  comparison.  If  we 
assume  the  given  quantity  is  contained  a  general  register,  we  can  use 
a  compare  instruction  (C  R1,D2(X2,B2))  followed  by  a  branch-on- 
condition  instruction  (BC  M1,D2(X2,B2))  to  implement  the  operation. 
The  compare  has  an  average  execution  time  of  1. 40  /is  and  the  branch 
on  condition  has  an  average  execution  time  between  0.  80  /its  and  1.10 
118,  depending  upon  the  frequency  with  which  the  branch  is  successful. 

The  "fetch"  operations  characterized  by  and  vr  are  used  to 
move  the  contents  of  the  field  to  which  the  position  indicator  points  to 
some  other  memory  location.  If  one  of  the  general  registers  contains 
the  address  of  the  location  to  whicr.  the  given  quantity  is  to  be  trans¬ 
ferred,  we  can  implement  the  operation  by  using  a  load  instruction 
(L  R1,D2{X2,B2))  followed  by  a  store  instruction  (ST  R1,D2(X2,B2)), 
which  have  average  execution  times  of  1.  20  /.is  and  0.93  fis,  respec- 
ively. 

Clearly,  there  is  a  certain  amount  of  overhead  associated  with 
each  of  these  operations  which  we  have  net  considered  -  such  as,  the 
amount  of  time  involved  in  program  "looping"  when  an  operation 
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or  a  sequence  of  operations  is  to  be  performed  recursively.  Our 
analysis  should,  however,  give  us  a  good  estimate  of  he  relative 
times  required  to  perform  each  of  these  operations. 

If  we  normalize  and  round-off  the  time  quantities  indicated  above, 
we  find  that  a  pointer  operation  requires  4  time  units,  a  stack  operation 
requires  2  time  units,  a  compare  requires  8  time  units,  and  a  fetch 
requires  7  time  units.  Thus, 


{i* 


=fi<rhi= 


=hio=fa=ywvwv 


s  =  s,  =  •  •  • 
0  1 


=  2 
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Consider  now  those  parameters  which  pertain  to  storage.  Since 
an  address  on  the  IBM  360  is  24  bits  in  length,  we  could  assume  a 
pointer  field  to  be  3  bytes  in  length.  For  program  efficiency,  hrwever, 
we  should  require  the  right-hand  end  of  this  field  to  be  aligned  on  a 
full  word  boundary.  This  is  best  accomplished  by  using  a  4-byte  (or 
full  word)  field.  Therefore,  we  will  assume  pointer  fields  to  be  4  bytes 
in  lengtn.  The  left  most  (unused)  byte  can  be  used  for  type  information. 
Thus , 
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st=0 


By  assuming  that  the  sou  rce  and  target  ring  heads  consist  simply 
of  a  pointer,  we  also  have 


Since  20  of  the  22  relations  of  interest  in  our  diagnostic  problem 
are  probabilities,  we  will  assume  that  the  relation  symbol  name  fields 
of  our  structure  are  large  enough  to  contain  the  actual  values  of  the 
probabilities  to  which  these  relations  correspond.  In  particular,  this 
must  be  4  bytes.  Therefore, 


We  will  assume  that  the  relation  ring  heads  consist  of  a  relation 
symbol  name  field  and  pointer,  Therefore, 


The  description  block  corresponding  to  a  symptom  need  contain 
only  the  name  or  a  code  representing  that  symptom.  For  this,  a 
description  block  consisting  of  4  bytes  is  probably  sufficient.  On  the 
other  hand,  since  each  test  has  a  cost  associated  with  it  and  each 
disease  has  a  probability,  the  description  blocks  corresponding  to  them 
should  be  on  the  oraer  of  3  bytes,.  Let  us  assume,  therefore,  that  the 
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average  description  block  consists  of  6  bytes.  Thus, 


Finally,  since  the  IBM  360/67  is  a  virtual  memory  machine,  we 
can  assume  that  there  is  no  limit  (within  reason,  or  course)  to  the 
amount  of  memory  available  for  our  structure.  This  means  that 

S  =  oo 
o 

We  now  have  a  complete  description  of  the  environment  within 
which  our  optimal  storage  structure  must  reside. 

6.1.6  The  Solution 

We  are  now  ready  to  proceed  with  determination  of  the  optimal 
storage  structure. 

In  oui  search  for  the  optimal  solution  we  partition  (rather  arbi¬ 
trarily)  the  set  of  feasible  solutions  into  a  number  of  families  of  solu¬ 
tions,  one  for  each  distinct  combination  of  values  assigned  to  0^°  •  °  0^. 
Each  of  these  so-called  0-families  contain  all  the  feasible  solutions 
which  have  a  particular  combination  of  values  for  0j*  •  •  0^. 

In  addition  to  determining  the  overall  optimal  solution  (or  solutions), 
we  also  determine  the  best  solution  (or  solutions)  within  each  of  the 
0-families  and  this  information  is  recorded  along  with  the  overall 
optimum.  Appendix  E  contains  a  listing  of  these  results  for  the  pro¬ 
blem  currently  under  consideration,  which  we  will  designate  Case  1. 
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Upon  inspection  or  this  information  we  find  that  there  are  six 
equally  good  solutions  for  our  problem,  four  of  them  in  0-family 
number  10  and  two  in  0-family  number  28.  [  The  number  assigned 
to  a  0-family  simply  indicates  its  position  in  the  sequence  of  0-families 
as  they  are  generated  and  is  used  to  facilitate  referring  to  individual 
ones.  ]  Table  6-9  contains  more  complete  descriptions  of  these  solu¬ 
tions. 

Let  us  examine  each  of  the  .solutions  in  turn,  starting  with  Solution 
1  of  0-family  28.  Given  the  values  for  the  decision  variables  and  the 
values  for  k^*  •  we  can  generate  the  schematic  representation  of 
Figure  6-2. 

In  this  schematic  each  type  of  block  which  has  not  been  eliminated 
from  the  structure  is  represented  by  a  single  field  regardless  of  tie 
actual  number  of  fields  which  that  block  may  contain.  The  double- 
walled  field  at  tb  j  top  of  the  structure  represents  a  description  block. 
The  solid  arrows  emanating  from  the  b^-blocks  and  the  b^-blocks 
correspond  to  the  pointers  of  the  a^-rings,  and  the  dashed  arrows 
originating  from  the  b^ -blocks  represent  head  pointers.  The  arrows 
originating  from  the  b^-blocks  correspond  to  a^-ring  head  pointers 
and,  of  course,  the  arrows  from  the  bjj -blocks  represent  description 
block  indicators.  The  source,  relation,  and  target  rings  -  although 
present  -  are  not  shown. 
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0-  family:  10 

Solution  1 

01‘“'01O:  0  0  0  01  0  0  0  0  0 

0i —  0io:  ooooiioooo 

*l”AlO:  1  0  1  1  0  0  1  1  1  1 

02*  •  •  %o  :  0  0  1  0  0  0  1  1  1 

Vk10:  13  1116  1111 

Solution  2 

0i°"0io:  ooooiooooo 

0i —  0io:  ooooiooioo 

A1():  10  110  110  11 
^2*  •  *  01O-  0  0  1  0  0  1  0  0  1 

kl-”k10:  13  111116  11 


Table  6-9.  Case  1  Optimal  Solutions 
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Solution  3 


V**01O:  0000100000 

0i‘ " 0iO:  0000110000 

A1*’'A10:  1  1  1  0  0  0  1  1  1  1 
02*  •  •  01O:  1  1  0  0  0  0  1  1  1 


Solution  4 

01-*‘01O:  0000100000 
0^‘*0*o:  0000100100 

A1°*‘A10:  1110011011 
i32*  ‘  *  /310:  1  1  0  0  0  1  0  0  1 

kr-kio:  1 1 1 3 1 1 1 6 1 1 


Table  6-9.  Case  1  Optimal  Solutions  (Cont. ) 
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0-family:  28 
Solution  1 

0i —  0iO:  0010000000 

0i° ' 0 0iO:  0010010000 

Aj*  •  *  A1q:  10  0  110  1111 

j32*  *  *  %0:  0  0  0  1  0  0  1  1  1 

kj...k10:  13  1116  1111 

Solution  2 

•  •  01Q:  0010000000 

0j,  *  •  *  0jo*  0010000100 

Al“°'A10:  1001111011 

P2°  •  •  J310:  0  0  0  1  0  1  0  0  1 

ki--k10:  13  111118  11 


Table  6-9.  Case  1  Optimal  Solutions  (Cont. ) 
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Items  Common  to  All  Six  Solutions 


m  =  1  m  =  18  m i  =  1  m  =  1 

a  P  ri  r2 


Operation  Method  Cost 

A  A 


Q3:  Vf 

l(d  r  p) 

A  A 

t„  1  =116,00 

di-Pk 

1  (d  r  p) 

A 

t?  x  =  285.  22 

«12:di*- 

l(dP) 

A 

4 2  i=272'00 

-*Pk 

l(Pd) 

t.„  .=530.00 
14,i 

T  =  252.96 

S  =  33398 

Table  6-9,  Case  1  Optimal  Solutions  (Cont.) 
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Since  k^  =  1,  the  a^-ring  head  pointer  might  appear  to  be  super¬ 
fluous  -  the  forward  pointer  from  the  b^ -block  to  the  b^-block  -  are 
to  perform  the  same  function  -  but  in  goneral  this  is  not  the  cast  We 
earlier  assumed  that  a  head  pointer  points  to  the  next  item  of  rect 
interest  as  we  sequence  through  the  structure.  In  this  case  the  next 
item  of  interest  when  following  the  he\d  pointer  or  moving  in.  hat 

direction  is  the  •>,, -block  (and  its  relation  symbol  name  fiel  ’’ .  On 
o 

the  other  hand  the  a^-ring  forward  pointer  simply  points  to  the  next 
forward  pointer  of  the  ring;  in  order  to  access  the  corresponding 
bg-block  we  must  step  from  this  pointer  field  to  the  bg-block  at  a 
cost  of  s  time  units.  It  should  be  clear  then  that  the  a„-ring  head 

0  o 

pointer  performs  a  function  distinct  from  the  forward  pointer  which 
parallels  it. 

If  we  assume,  however,  that  the  environment  in  which  the  structure 
is  implemented  is  a  computer  utilizing  the  base -displacement  form  of 
addressing  (ala  the  IBM  360),  then  the  forward  pointer  can  perform  the 
same  function  as  the  head  pointer  (in  addition  to  its  original  function), 
in  which  case  the  head  pointer  may  be  omitted.  (This  would  not  be 
true,  of  course,  if  k^  were  greater  than  1.) 

If  desired,  we  could  include  in  our  procedure  the  facility  for 
eliminating  from  those  rings  <1  the  storage  structure  for  which  k^=  1 
any  head  pointers  which  might  otherwise  arise.  We  do  not  ch<  to 
do  so  here  and,  hence,  wul  maintain  the  structure  as  given. 
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Replacing  each  block  by  the  fields  it  contains  yields  the  actual 
storage  structure  which  appears  in  Figure  6-3  sans  source,  relation, 
and  target  ring  pointers. 

Since  the  methods  used  to  implement  operations  Q„,  Q_,  and  Q1 

Of  1  a 

use  the  source  ring  as  starting  points  and  the  method  used  to  imple¬ 
ment  operation  uses  the  target  rings  (none  of  the  operations  uses 

the  relation  rings)  and  since  the  b^-blocks  contain  a  relation  symbol 

0 

name  field  (hence,  the  relation  rings  need  never  be  used  to  determine 
the  relation  represented  by  a  given  block) ,  the  relation  rings  are  not 
really  needed  in  the  structure.  We  may  assume,  therefore,  that  they 
are  not  present. 

Furthermore,  since  ma=  1,  it  is  not  necessary  for  the  source 
rings  to  be  closed  loops.  That  is,  a  b^-Mock  need  not  contain  a 
pointer  back  to  the  head  of  the  corresponding  source  ring. 

Again,  we  could  include  in  our  procedure  the  steps  necessary 
to  automatically  account  for  these  contingencies.  We  did 
not  feel  that  the  effort  required  to  do  so  for  this  and  other  such  cases 
was  warranted  for  a  prototype  program,  however. 

Thus,  the  oniy  addition  required  to  make  Figure  6-3  more 
accurately  reflect  the  actual  storage  structure  is  the  inclusion  of  a 
target  ring  pointer  for  every  description  block  indicator.  (We  leave 
this  to  the  imagination  of  the  reader.) 
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Taking  omission  of  the  relation  rings  and  the  indicated  source  ring 
pointers  into  account  results  in  the  reduction  of  the  storage  cost  of  the 
storage  structure  to  31414  units. 

Now  let  us  examine  Solution  2  of  0-family  28.  This  solution  has 
the  schematic  representation  of  Figure  6-4.  We  note  that  this  schematic 
is  the  same  as  that  of  Figure  6-2  with  the  exception  that  each  bg-block 
of  Figure  6-2  is  replaced  by  a  bg-blork  -  bg-block  pair  and  each  b^- 
block  is  replaced  by  a  b^-block. 

As  a  result,  the  physical  structure  of  this  solution  is  identical  to 
that  of  the  first  solution  considered. 

In  fact,  this  is  true  for  the  four  solutions  of  0-family  10,  also! 

Thus,  although  each  of  the  six  given  solutions  has  a  unique  speci¬ 
fication  ,  they  all  result  in  the  same  physical  storage  structure. 

6.1.7  Justification  of  the  Solution 

We  will  now  present  an  intuitive  justification  for  the  solution  we 
have  obtained.  Because  of  their  relative  weignts,  operations  Qg  and 
(especially  operation  would  be  expected  to  influence  the  form 
of  the  structure  considerably  more  than  operations  a^d  Q^. 

Of  the  three  alternative  methods  available  for  operation  Q,,  - 

A  A  A  A  A  A 

namely,  drp,  r  d  p,  and  p  r  d  -  we  would  expect  either  of  the  first 
two  to  be  preferable  to  the  third  on  the  grounds  that  fewer  possible 


solutions  must  be  considered  when  using  one  of  them  than  when  using 
the  third.  By  the  same  token,  since  the  number  of  distinct  relations 
which  apply  to  a  given  source  is  less  than  the  number  of  sources 
associated  with  a  given  relation,  we  would  expect  the  first  method  to 
be  preferable  to  the  second. 

Similarly,  of  the  four  methods  available  for  operation  - 

A  A  A  A  A  A  AA 

d  r  p,  p  r  d,  r  d  p,  and  r  p  d  -  we  would  expect  the  first  two  to  be 
preferable  to  the  last  two.  If  operation  were  considered  by  itself, 
either  of  the  first  two  methods  would  probably  serve  equally  well. 
However,  since  our  choice  of  methods  for  operation  favors 
starting  with  sources,  we  are  encouraged  to  choose  the  first  method 
for  operation  Q^. 

in  light  of  our  choices  of  methods  for  operation  and  Q^,  we 

A  a 

would  certainly  prefer  method  d  p  to  method  p  d  for  operation  Q^. 

In  view  of  our  choices  so  far  we  might  be  tempted  to  choose  method 
a  a 

d  p  over  method  p  d  for  operation  Q^,  but  the  latter  is  probably  pre¬ 
ferable  in  all  but  the  most  extreme  cases  and  so  we  choose  it.  In  any 
event  since  the  relative  weight  cf  operation  is  so  small,  our 
decision  here  probably  will  not  have  a  very  great  effect  upon  the  form 
of  the  storage  structure  anyway.  (We  note  that  it  is  responsible  for 
inclusion  of  the  ag-ring  head  pointers,  however.) 

Elution  of  our  problem  does,  of  course,  support  our  intuitive 
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choices  of  methods. 

Let  us  examine  the  structure  of  Figure  6-3  given  our  choice  of 
methods  for  the  various  operations.  In  order  for  operation  to  be 
as  efficient  as  possible,  the  relations  which  apply  to  a  given  source 
should  be  easily  accessible  from  that  source.  At  the  same  time  to 
promote  efficiency  for  operation  Q^,  the  targets  (as  well  as  the 
relations)  associated  with  a  given  source  should  be  easily  accessible. 

Intuitively  one  might  feel  that  duplication  of  the  b^-blocks  upon 
the  bg-blocks  should  render  operation  Q?  somewhat  more  efficient 
than  the  rings  of  our  given  storage  structure.  In  this  case  access  to 
all  targets  could  be  made  by  stepping  through  a  single  stack  In 
the  context  of  our  current  problem  there  are  two  pitfalls  to  this 
approach:  First,  the  performance  of  operation  suffers  in  this 
case  because  we  must  step  through  a  stack  of  description  block  indi¬ 
cators  in  order  to  access  one  relation  symbol  name  field  from  another. 
Second,  for  operation  itself  we  must  step  all  the  way  through  a 
stack  of  description  block  indicators  (to  reach  the  next  relation  symbol) 
even  though  the  target  of  interest  may  already  have  been  found.  With 
the  structure  of  Figure  6-3  this  is  not  necessary.  Thus,  the  rings  of 
our  structure  would  appear  to  be  the  logical  chuice. 

Once  a  given  relation  symbol  has  been  found,  a  stack  of  description 
block  indicators  (as  contained  in  the  given  structure)  would  appear  to  be 
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the  best  choice  for  both  operations  Q„  and  Q  . 

u  i 

Because  of  the  similarlity  between  operation  Q1  and  operations 

^2  and  and  because  of  the  small  relative  weight  of  operation  Q^, 
the  given  structure  should  render  it  relatively  efficient,  also. 

In  spite  of  the  fact  that  there  are  on  the  average  18  b^ -blocks  for 
each  target  in  our  structure,  operation  Q14  is  still  relatively  effi¬ 
cient.  The  head  pointers  from  the  b--blocks  to  the  b„ -blocks  and  the 

I  0 

explicit  a^ -rings  both  contribute  to  making  access  of  the  sources 
associated  with  a  given  target  very  fast.  We  note  that  duplication  of 
the  bA-blocks  upon  the  bg  -blocks  would  be  a  detriment  to  this  opera¬ 
tion,  also. 

Thus,  the  storage  structure  of  Figure  6-3  is  appealing  even  on 
an  intuitive  bas  .s. 

6.1.8  Comparison  with  an  Existing  Storage  Structure 

In  order  to  put  the  solution  we  have  obtained  into  proper  perspective 
let  us  compare  it  with  the  scheme  used  by  Gorrv  in  his  implementation 
of  the  diagnostic  system. 

Gorry's  storage  structure  consists  of  a  number  of  SLIP  lists  which 
are  maintained  and  interrogated  through  use  of  the  SLIP  language. 
Although  he  concedes  that  this  structure  may  at  times  result  in 
inefficient  utilization  of  the  computer  memory,  he  maintains  that  this 
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disadvantage  was  more  than  offset  by  the  convenience  of  programming 
in  the  high-level  SLIP  language.  Since  he  was  concerned  with  imple¬ 
menting  a  prototype  system,  we  must  agree  wholeheartedly  with  this 
philosophy. 

In  his  structure  Gorry  defines  a  basic  information  block  (our  data 
item)  to  be  either  a  state  (i.e. ,  a  disease),  an  attribute  (i.e. ,  a 
symptom),  or  a  test.  Each  of  these  basic  blocks  is  then  represented 
by  a  SLIP  list.  The  result  is  a  number  of  state  lists,  attribute  lists, 
and  test  lists . 

A  state  list,  as  shown  in  Figure  6-5,  consists  of  a  ring  of  attri¬ 
bute  list  names  (i.e. ,  pointers  to  the  heads  of  certain  attribute  lists), 
one  for  every  attribute  relevant  to  the  state,  and  a  description  list  (or 
DLIST)  which  contains  the  a  priori  probability  of  the  given  state  and 
the  print  name  of  the  state , 

An  attribute  list,  as  shown  in  Figure  6-6,  consists  of  a  ring  of 
test  list  names  corresponding  to  those  tests  which  can  result  in  the 
given  attribute  and  a  DLIST  which  contains  the  print  name  of  the 
attribute  and  a  pointer  to  a  member  list.  The  member  list  contains 
the  list  name  of  each  state  list  on  which  the  name  of  the  attribute  list 
appears  and  the  corresponding  probability  of  the  attribute  given  the 
state. 

Finally,  a  test  list,  which  appears  in  Figure  6-7,  contains  the  cost 
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Figure  6-5.  State  List 


080 


6-7.  Test  Lie 


of  the  test  and  a  DLIST.  The  DLIST  contains  the  print  name  of  the 
test,  and  a  member  list  for  the  attribute  lists  which  include  this  test. 

For  the  heart  disease  problem  which  we  have  been  considering 
each  state  iis ,  will  contain  an  average  of  28. 4  attribute  list  names; 
each  attribute  list  will  contain  1  test  list  name  and  its  member  list 
will  contain  an  average  of  18. 7  pairs  of  state  list  names  and  prob¬ 
abilities;  and  each  test  list  will  have  an  average  of  1. 7  attribute  list 
names  in  its  member  list. 

Using  this  information  and  the  description  of  the  environment 
used  in  determining  our  optimal  storage  structure,  let  us  determine 
the  cost  of  each  of  the  basic  operations  involved  in  the  diagnostic  pro¬ 
cess.  To  reiterate  briefly,  the  basic  operations  are:.  (1)  determine 
the  diseases  associated  with  a  given  symptom,  (2)  determine  the 
symptoms  associated  with  a  given  disease,  (3)  determine  the  condi¬ 
tional  probability  that  a  particular  symptom  is  associated  with  a  given 
disease,  (4)  determine  the  tests  relevant  to  a  given  disease,  and 
(5)  determine  the  possible  outcomes  of  a  given  test, 

The  fii  st  of  these  operations  involves  finding  the  attribute  list 
corresponding  to  the  given  symptom  and  recording  the  list  name  of 
each  state  list  on  its  member  list.  (The  print  name  of  each  disease 
is  irrelevant  at  this  point  since  the  results  of  this  operation  are  simply 
used  as  the  starting  points  for  the  second  operation.)  Finding  the  head 
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of  the  attribute  list  requires  T  +F  units  of  time  .  Accessing  the 
head  of  the  corresponding  member  list  involves  moving  to  the  DLIST 
at  a  cost  of  sQ  +  f  time  units  (where  f  represents  the  time  cost  to 
follow  a  pointer)  ,  stepping  through  the  DLIST  at  a  cost  of  4f  time  units 
(ignoring  the  cost  of  any  comparisons  which  must  be  made) ,  and 
finally  moving  to  the  head  of  the  member  list  at  a  cost  of  sq+  f  time 
units.  Recording  the  list  name  of  each  of  the  state  lists  then  involves 
stepping  through  the  member  list  at  a  cost  of  18. 7  (2f)  and  fetching 
each  of  the  state  list  names  at  a  cost  of  3  plus  the  actual  cost  of  a 
fetch  (which  we  will  represent  by  v  and  assume  requires  7  time 
units).  Summing  these  incremental  costs  yields  a  time  cost  of  353. 9 
for  this  operation. 

The  second  operation  starts  at  the  head  of  a  state  list  and  records 
the  print  name  of  each  attribute  on  the  list.  The  time  cost  for  this 
operation  is  given  by  the  expression 

28.  4  (f  •*  s  +  4f  +  s  +  v) 
o  o 

where  f  time  units  are  required  to  access  an  element  of  the  state  list 
(from  the  previous  one),  sq  units  are  required  to  move  to  the  corre¬ 
sponding  attribute  list  name,  4f  units  are  required  to  access  the  print 
name  block  on  the  DLIST  of  the  given  attribute  list,  and  sq  +  v  units 
are  required  to  fetch  the  print  name  .  As  a  result  this  operation  has  a 
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time  cost  of  860. 4  time  units. 

The  third  operation  -  determining  t»*e  conditional  probability  erf 
a  symptom  given  a  disease  -  involves  finding  the  attribute  list  corre¬ 
sponding  to  the  given  symptom,  searching  the  associated  member  list 
for  the  given  disease,  and  recording  the  corresponding  probability. 

To  the  point  of  accessing  the  member  list  this  operation  is  identical  to 
the  first  operation  and,  hence,  requires 

T  +  +  s  +  s,  +  s  +  f 

a  a  o  f  o 

time  units  to  reach  this  point  in  the  structure.  On  the  average  we 
would  expect  to  examine  (18. 7  +  l)/2  state  list  entries  of  the  member 
list  before  finding  the  given  disease.  For  each  oie  examined  we  must 
access  the  member  list  element  block  (from  the  previous  one)  at  a 
cost  of  f  time  units,  access  the  head  of  the  corresponding  state  list  at 
a  cost  of  so  +  f,  then  access  and  compare  the  print  name  at  a  cost  of 
5f  +  sQ  +  c.  (where  c  represents  the  cost  of  a  comparison  and  has  the 
value  8),  and  finally  move  to  the  member  list  element  block  containing 
the  value  of  the  probability.  For  the  last  state  list  considered  we  must 
fetch  the  value  of  the  probability,  which  costs  +  v  time  units.  The 
resultant  time  cost  for  this  operation  is  437,0  time  units. 

To  perform  the  fourth  operation  -  determination  of  the  tests 
relevant  to  a  giver;  disease  -  we  must  access  the  state  list  representing 
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the  disease  and  access  in  tc  m  the  associated  attribute  lists,  which 
contain  the  list  names  of  the  pertinent  tests.  This  requires  T  +  F0 

cL  a. 

time  units  to  access  the  head  of  the  state  list,  f  -+•  +  f  units  to 

access  each  of  the  28. 4  attribute  lists,  f  +  sQ  +  f  units  to  access  the 
corresponding  test  list,  and  3f  +  kq  +  v  time  units  to  fetch  the  print 
name  of  the  test.  The  time  cost  for  the  operation  is  therefore  1172. 4 
time  units. 

Finally,,  the  fifth  operation  -  determining  the  possible  outcomes 
of  a  test  -  involves  fetching  the  print  names  of  the  attribute  associated 
with  the  given  tes:  list.  To  do  this  we  must  access  the  member  list 
for  the  given  test  at  a  cost  of  Ta  +  Fft  +  5f  +  so  +  f  and  access  the 
print  name  of  each  of  the  1. 7  attribute  lists  thereon  at  a  cost  of 
1. 7  (f  +  sQ  +  4f  +  sQ  +  v).  The  resultant  time  cost  is  86. 7  tine  units. 

In  crder  to  make  a  fairly  coarse  comparison  of  the  SLIP  structure 
with  the  optimal  structure,  let  us  compare  these  time  sosts  with  those 
determined  for  the  primitive  operations.  Operations  1,2,  and  3 
correspond  to  primitive  operations  Qjg,  Q^,  and  Q^,  respectively, 
and  operations  4  and  5  correspond  to  primitive  operation  Qg.  Table 
6-10  contains  a  summary  of  the  time  costs  !ci’  both  sets  of  operations, 
where  the  costs  of  operations  4  and  5  have  been  weighted  according  to 
their  relative  frequencies  (operation  5  is  used  five  times  as  much  as 
operation  4)  and  combined  to  facilitate  comparison  with  the  time  cost 
for  primitive  operation  Q,.  The  corresponding  values  of  the  weighted 
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SLIP  Structure 


Optimal  Structure 


Relative 


Operation 

Cost 

Frequency 

Operation 

Cost 

1 

353.9 

0.001 

®12 

272.00 

2 

880.4 

0.001 

«14 

530.00 

3 

437.0 

0.806 

*7 

285.22 

4  &  5 

267.6 

0.192 

JO 

GO 

116.00 

T  = 

404.8 

T  =  252.96 

Table  6-10.  Operation  Time  Costs  for  the 

SLIP  Structure  versus  the  Optimal 
Structure 
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time  cost  function  also  appear  in  Table  6-10.  We  see  that  the  time 
cost  T  for  the  SLIP  structure  is  60  percent  greater  than  that  for  the 
optimal  structure. 

In  order  to  achieve  a  somewhat  better  comparison  of  the  two 
structures  let  us  determine  the  cost  of  each  of  the  five  basic  operations 
using  the  optimal  storage  structure. 

There  are  two  reasons  why  computing  the  costs  of  the  five  basic 
operations  for  the  optimal  structure  (instead  of  using  the  costs 
determined  for  the  primitive  operations  in  obtaining  the  optimal 
structure)  should  give  us  a  better  basis  for  comparison. 

First,  in  certain  cases  our  primitive  operations  are  only  approxi¬ 
mations  to  the  corresponding  basic  operations  which  they  ostensibly 
represent.  For  example,  we  note  that  primitive  operation  is  really 
more  general  than  operation  3,  its  counterpart.  In  particular,  Q? 
examines  all_  relations  associated  with  a  given  symptom  (acting  as  a 
source)  to  determine  whether  or  not  the  given  disease  is  a  possible 
target.  In  practice  we  know  that  there  is  only  one  such  relation  and 
once  it  has  been  found  all  others  may  be  disregarded.  This  mismatch 
of  operations  could  be  easily  overcome,  of  course,  by  introducing 
another  primitive  operation  which  searches  for  exactly  one  relation 
(possibly  the  first  encountered  out  of  many,  or  else  the  only  one) 
which  associates  a  given  target  with  a  given  source.  Adopting  such  a 
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posture  for  all  similar  cases  might  result  in  a  plethora  of  primitive 
operations,  however.  In  addition,  the  added  effort  required  to  derive 
time  cost  expressions  for  these  additional  primitive  operations  might 
not  be  rewarded  by  proportionate  increases  ir>  the  accuracy  of  the 
optimal  structure  obtained. 

The  second  reason  for  computing  the  basic  operation  costs  for 
the  optimal  structure  is  that  each  of  these  time  costs  may  reflect  the 
actual  (average)  numbers  of  blocks  we  may  expect  to  encounter  in  the 
structure  for  the  relations  with  which  the  corresponding  operation  is 
intimately  concerned.  The  primitive  operation  time  costs  which  we 
are  given,  on  the  other  hand,  reflect  the  average  over  all  relations. 

Let  us  proceed,  therefore,  with  osculations  of  the  time  costs  for 
the  five  basic  diagnosis  operations,  using  the  optimal  storage  structure. 

The  first  operation  -  determining  the  diseases  associated  with  a 
given  symptom  -  involves  finuing  all  the  description  blocks  for  diseases 
associated  with  a  given  symptom.  To  perform  this  operation  we  utilize 
that  portion  of  the  structure  representing  the  relations  r^  •  •  •  r°  ,  which 
have  symptoms  as  sources  and  diseases  as  targets.  We  must  find 
the  description  block  representing  the  given  symptom,  step  through  a 
stack  of  on  the  average  6.  6  -rings  (using  the  notation  of  Solution  1 
for  0-family  28),  and  for  each  of  these  step  through  a  stack  of  2.  8 
-blocks  to  access  a  total  of  18. 7  description  block  indicators. 
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Accessing  the  symptom  description  block  requires  T  +  F.  time  units, 

ci  a 

stepping  through  the  a^-rings  and  following  their  respective  head 
pointers  to  the  stacks  of  description  block  indicators  requires  6.  6 
(sQ  +  f)  time  units,  and  recording  the  various  disease  description  bio  :k 
indicators  requires  18. 7  (sQ  +  v)  time  units.  The  resultant  time  cost 
is  215.9  time  units. 

The  second  operation  -  determining  the  symptoms  associated  with 
a  given  disease  -  also  relies  upon  that  portion  of  the  storage  structure 
which  represents  relations  r°  •  •  •  rl>  For  this  operation,  however, 
we  must  access  (via  the  appropriate  target  ring)  all  b^ -blocks  repre¬ 
senting  a  given  disease  and  move  upward  through  the  structure  to  access 
the  corresponding  symptom  description  blocks.  For  this  case  mp  will 
have  an  average  value  of  28.4.  Therefore,  accessing  all  b^ -blocks 
representing  a  given  disease  will  require  T  +■  29. 4F  time  units. 

CL  A 

Since  the  stack  of  b^-blocks  contains  on  the  average  6.6  blocks,  we 

would  expect  to  step  through  (6, 6+l)/2  of  these  on  the  way  to  the  symp- 

* 

tom  description  block  .  Thus,  tor  each  b^ -block  encountered  the 
time  cost  required  to  access  the  corresponding  symptom  description 
block  will  be  s  +  f  +  s  +  f  +  3. 8s  .  The  total  time  required  for  this 
operation  will  then  be  67S.  2  time  units. 

Note  that  if  the  value  of  k  ^  had  actually  been  6.  6  instead  of  3  the 

optimal  storage  structure  would  have  contained  head  pointers  to  facili¬ 
tate  this  ope  ration. 
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The  third  operation  -  determination  of  the  conditional  probability 

that  a  symptom  is  associated  with  a  given  disease  -  is  very  similar 
in  implementation  to  the  first  operation.  In  this  case,  however,  since 

v’emay  halt  our  transversal  of  the  structure  once  we  have  found  the 
particular  disease  sought,  we  need  consider  an  average  of  only 
(6. 6+l)/2  a^- rings  and  (1ft.  7+l)/2  disease  description  blocks.  Thus, 
accessing  the  symptom  description  block  requires  TQ+  Fra  time  units, 
stepping  through  the  stack  of  a3 -rings  and  following  their  respective 
head  pointers  requires  3.  8(sq+  f)  time  units,  examining  the  various 
disease  description  blocks  requires  9.  &  (sQ+  f  +  c)  time  units,  and 
fetching  the  value  of  the  probability  relating  the  given  symptom  and 
the  given  disease  requires  v  time  units.  The  time  cost  of  the  third 
operation  is  therefore  175.0  time  units. 

Operation  4  -  determining  the  tests  relevant  to  a  given  disease  - 
uses  the  portion  of  the  storage  structure  which  represents  the  relation 
rp  which  has  diseases  as  sources  and  their  relevant  tests  as  targets. 
This  operation  involves  accessing  a  disease  description  block  at  a 
cost  of  T  +  F0  time  units,  stepping  through  a  stack  containing  a  single 
a^-ring  and  accessing  the  corresponding  stack  of  b^ -blocks  at;  a  cost 
of  sQ  +  f,  and  fetching  the  tests  indicated  by  the  test  description  block 
idicators  in  this  stack  (19. 3  of  them  on  the  average)  at  a  cost  of  19. 3 
(sQ+  f+  v).  This  results  in  a  time  cost  of  264. 9  time  units. 
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Operation  5  -  determining  the  possible  outcomes  of  a  given  test  - 
uses  that  part  of  the  structure  representing  the  relation  r ^  which  has 
tests  as  sources  and  symptoms  as  targets.  Its  implementation  is 
identical  to  that  for  the  fourth  operation,  but  since  the  stack  of  b^- 
blocks  contains  on  the  average  only  1. 7  blocks,  its  time  cost  is  36. 1 
time  units. 

Table  6-11  contains  a  summary  of  these  time  costs  and  the  overall 
time  cost  in  comparison  with  the  corresponding  quantities  for  the  SLIP 
structure.  On  the  basis  of  this  comparison  we  see  that  the  time  cost 
T  for  the  SLIP  structure  is  2. 6  times  greater  than  that  for  the  optimal 
structure. 

Let  us  also  compare  the  storage  costs  of  the  two  structures  under 
consideration.  Assuming  that  each  field  of  a  SLIP  list  block  requires 
4  bytes  of  storage,  each  block  then  requires  12  bytes.  Summing  the 
number  of  blocks  required  to  implement  a  list  over  33  state  lists, 

50  attribute  lists,  and  30  test  lists  results  in  a  total  storage  cost  of 
44355  units,  which  is  41  percent  greater  than  the  31414  units  required 
by  the  optimal  storage  structure. 

Finally,  let  us  investigate  how  the  SLIP  structure  might  be 
described  by  our  storage  structure  model,  and  then  determine  the 
time  and  storage  costs  for  the  diagnostic  problem  using  this  structure 
and  the  required  primitive  operations. 
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Operation 


Time  Cost 


SLIP  Structure 

Optimal  Structure 

1 

353.6 

215.9 

2 

880.4 

678.2 

3 

437.0 

175.0 

4 

1172.4 

264.9 

5 

86.7 

36.1 

T  =  404.8 

T  =156.2 

Table  6-11.  Basic  Operation  Time  Costs  for  the 

SLIP  Structure  Versus  the  Optimal  Structure 
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If  we  examine  Figures  6-5,  6-6,  an^  „  7  closely,  we  see  tuat  the 
state,  attribute,  and  test  lists  are  all  basically  the  structure  in  Figure 
6-8. 

In  particular,  for  the  state  list  if  we  assume  that  the  head  of  the 
list  and  the  DLIST  may  be  combined  into  the  description  block  (the 
double -walled  blcck)  of  Figure  6-8,  then  except  for  the  presence  of 
relation  symbol  name  fislds  the  remainder  of  the  structure  is  identical 
to  that  of  Figure  6-8. 

If  we  ignore  for  the  moment  the  associated  member  list,  then  the 
same  can  be  said  for  the  attribute  list.  If  we  now  combine  each  pair 
of  blocks  in  the  member  list  into  a  single  block  like  those  in  Figure 
6-8  and  if  we  assume  that  the  head  of  the  member  list  and  the  DLIST 
are  combined,  we  have  for  the  member  list  exactly  the  structure  of 
Figure  6-C.  Thus,  the  attribute  list  can  be  considered  to  consist  of 
a  single  description  block  wJLn  two  associated  rings  like  that  of  Figure 
6-8. 

If  the  test  list  itself,  its  DLIST,  and  the  head  of  its  associated 
member  list  are  combined  into  a  single  block,  again  except  for  the 
relation  symbol  name  fields  we  have  the  structure  of  Figure  6-8. 

Since  each  of  the  state  lists,  attribute  lists,  and  test  list  member 
lists  may  be  viewed  as  representing  all  the  relation  instances  having 
a  given  souice  for  a  single  particular  relation,  we  may  use  the 
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Figure  6- 8.  Basic  SLIP  Structure 
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structure  of  Figure  6-8  (including  the  relation  symbol  name  fields)  to 

represent  each  of  these  lists  and  assume  that  for  each  list  all  of  the 

relation  symbol  name  fields  contain  the  same  relation  symbol  name 

and  occupy  zero  storage  units.  For  the  attribute  list  member  list 

the  relation  symbol  name  fields  may  contain  the  various  conditional 

probabilities  in  a  situation  analogous  to  the  representation  of  our 

relations  r*^  • *  r  In  speaking  of  a  single  homogeneous  structure 

for  the  representation  of  aU_  the  lists  we  may  then  use  the  structure 

of  Figure  6-8  and  specify  the  size  of  the  relation  symbol  name  field  as 

being  the  average  of  those  for  the  various  lists  (i.e. ,  sr  =  2  bytes). 

It  should  be  clear  that  within  the  context  of  the  SSM  the  structure 

of  Figure  6-8  may  have  the  description  contained  in  Table  6rl2,  where 

we  have  ignored  the  presence  of  reverse  pointers. 

Before  determining  values  for  the  unspecified  parameters  k^,  k° , 

and  k°  let  us  determine  the  values  of  k°  ,  k°  ,  and  k° .  We  would  nor- 
9  a’  p’  r 

mally  say  that  the  number  of  sources  is  equal  to  the  number  of  states 

plus  the  nunber  of  attributes  plus  the  number  of  tests  (i.e. ,  ko  =  113). 

a 

However,  since  we  wish  to  associate  two  rings  with  each  attribute,  we 
see  that  the  attributes  have  tv/o  distinct  roles  in  the  structure  and  as  a 
result  we  will  count  each  one  twice  so  that  k°  =  163,  (We  might  have 
two  source  rings  associated  with  each  attribute,  but  only  a  single 
description  block.)  We  will  count  each  attribute  as  a  target  only  once 
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0001000000 


^i0 '  °  01O:  oooooooooo 

1110111111 
02*  •  •  ^10:  1  1  0  0  0  1  1  1  1 


V0  a2=1 

Pl=  0  p2=1  p3=0 


ko  -  1 

k2=1 
k  3  =  1 

k°4  =  ? 


k°8=? 
k°9  =  ? 

“io-1 


A  question  mark  indicates  an 
unspecified  value  greater  than 
or  equal  to  1 . 


Table  6-1 2c  Specifications  for  the  Basic  SLIP  Structure 
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since  there  is  no  dichotomy  of  roles  here,  so  k°  =  113.  Finally, 

P 

k°  =23,  twenty  relations  corresponding  to  the  distinct  probabilities 
and  one  each  for  the  state  list,  attribute  list,  and  test  member  list 
functions. 

Let  us  now  consider  kc^,  k  °,  and  kg.  k^  corresponds  to  the 
average  number  of  relation  symbols  for  each  of  the  state  lists,  attri¬ 
bute  lists,  attribute  member  lists,  and  test  member  lists.  We  can 
easily  determine,  therefore,  that  k°  =  2.7. 

k°  represents  the  average  number  of  targets  for  each  source- 

O 

relation  symbol  pair  and,  hence,  will  have  the  value  k°  =  4.  4 
Since  the  expression 


k°k0k°k0  k°  -k°k°kC^k0k0 
K  2K  4K  8  10  a  i  3K  7*  9  p 


must  be  satisfied,  it  follows  that 


O 

P 


which  yields  the  result  k°  =  17. 1. 

Because  of  the  constraint  imposed  by  our  program  requiring 
these  parameters  to  be  integer  valued,  let  us  assign  the  following 
values  to  them* 


k°  =150 
a 


k°  =100  k°  =  23 

P  r 


k«-4  k°g  .  18 


:0  b 


k°,  *  3 
4 


Finally,  let  us  assume  m  =  4.  (This  should  be  approximately  the 

rp 

same  as  the  value  derived  for  m  in  specifying  the  values  of  para- 

rP 

meter*  for  our  derivation  of  the  optimal  structure  and  must  be  integer 
valued,  also,  so  we  choose  the  value  4.) 

As  a  last  step  before  computing  the  measures  of  performance  for 
this  storage  structure  we  must  determine  the  primitive  operations  to 
be  used  and  their  relative  frequencies.  Basic  operation  1  corresponds 
to  applying  primitive  operation  Q1  0  to  the  member  list  associated  with 
a  given  attribute.  Operation  2  may  be  accomplished  by  applying  pri¬ 
mitive  operation  to  the  proper  state  list.  Operation  3  may  be  re¬ 
resented  by  primitive  operation  Q7  applied  to  an  attribute  member  list 
Operation  4  involves  applying  primitive  operation  0>3  recursively  -  first 
to  a  given  state  list  and  then  to  each  of  the  attribute  lists  indicated  by 
the  first  application.  Lastly,  operation  5  corresponds  to  primitive 
operation  Q  as  applied  to  the  proper  test  membei  list.  It  follows  that 

O 

the  rt'ative  frequencies  of  primitive  operations  Q^,  Q^,  and  Q12  are  as 


given  below: 

Operation 

Frequency 

Relative  Frequency 

<tydirf) 

35095 

0.570 

Q7(di-Pk) 

26396 

0.429 

Q1 2(di*  ') 

13 

0.001 

61504 
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If  we  now  use  our  program  to  compute  the  time  and  storage  costs 
for  the  basic  SLIP  structure  under  consideration  (where  the  environ  ¬ 
ment  is  assumed  the  same  as  that  used  in  determining  our  optimal 
structure),  we  find  that 


T  =248.00 
S  =  35684 

At  first  glance  it  might  appear  as  though  this  structure  is  superior 
to  our  so-called  optimal  structure!  Indeed,  it  is  true  that  the  average 
time  cost  for  the  performance  of  a  single  operation  (which  is  essentially 
what  the  time  cost  function  T  represents)  for  this  structure  is  less  than 
that  for  our  optimal  structure  (comparing  the  two  computer  determined 
time  costs).  It  is  also  true,  however,  that  more  operations  must  be 
performed  for  this  structure  than  for  the  optimal  structure  in  order  to 
achieve  the  same  result.  In  particular  we  see  from  our  calculations 
of  the  relative  frequencies  of  the  primitive  operations  applied  to  each 
of  the  storage  structures  that  whereas  the  optimal  structure  requires 
the  performance  of  32734  operations  for  the  ’’solution"  of  the  diagnostic 
problem,  the  basic  SLIP  structure  requires  the  performance  of  61504 
operations.  If  we  normalize  the  time  cost  T  for  the  SLIP  structure  to 
32734  operations  (i.e. ,  if  we  multiply  this  time  cost  by  the  ratio 
81504/32734)  we  find  that  the  equivalent  time  cost  is  466.  24.  Thus, 
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the  time  cost  for  the  basic  SLIP  structure  of  Figure  6-8  is  really  1.  8 
times  that  of  the  optimal  storage  structure  when  compared  with  the 
program  determined  time  cost  (252, 96)  for  the  optimal  structure. 

To  complete  this  comparison  we  might  correct  the  storage  cost 
given  above  to  account  for  the  additional  attribute  description  blocks 
and  the  missing  reverse  pointers.  To  wit,  S  =  43184  storage  units, 
which  is  37  percent  greater  than  that  required  by  the  optimal  structure. 

We  might  aiso  find  it  instructive  to  compare  the  time  and  storage 
costs  for  the  basic  SLIP  structure  with  those  for  Gorry's  SLIP 
structure.  There  should  be  a  high  correlation  between  the  values  ob¬ 
tained  for  the  two  structures. 

If  we  compare  the  normalized  time  cost  for  the  basic  SLIP 
structure  as  we  have  computed  it  here  (466.  24)  with  the  time  cost  we 
have  determined  for  Gorry's  SLIP  structure  (404. 8),  we  see  that  the 
time  cost  for  the  basic  SLIP  structure  is  15  percent  greater  than 
that  for  Gorry's  structure.  This  difference  can  probably  be  attributed 
to  the  fact  that  primitive  operation  is  more  general  than  its  counter¬ 
part,  operation  3.  (We  discussed  this  matter  somewhat  earlier.) 

If  we  now  compare  the  storage  cost  for  the  basic  SLIP  structure 
(43184)  with  the  for  Gorry's  SLIP  structure  (44355),  we  find  th2t,  the 
basic  structure  has  a  storage  cost  within  3  percent  of  that  for  Gorry's 
structure. 
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These  results  would  tend  to  imply  that  even  though  our  model 
uses  average  values  to  characterize  a  particular  data  structure 
and  its  associated  storage  structure,  the  model  may  be  used  to  deter¬ 
mine  a  good  estimate  of  the  time  and  storage  costs  for  existing  or 
proposed  structures,  in  addition  to  determining  the  optimal  storage 
structure  for  a  given  application . 

The  conclusion  which  we  wish  to  draw  from  the  foregoing  com¬ 
parisons  is  quite  simple:  Gorry's  SLIP  storage  structure,  although 
presumably  designed  to  be  relatively  efficient  in  operation,  can  be 
significantly  surpassed  in  performance,  Ferhaps  we  do  Gorry  an 
injustice  by  making  this  statement,  for  he  does  admit  to  possible 
ineflicient  utilization  of  memory  (clearly,  for  this  application  the 
reverse  pointers  of  the  SLIP  structure  are  not  required).  In  any 
event,  his  structure  gives  us  a  reference  point  for  determining  the 
relative  improvement  in  performance  we  might  expect  by  using  a 
systematic  procedure  and  a  rigorous  framework  in  the  design  of  data 
representation  schemes. 

6.  2  Near  -  Optimum  Solutions 

Our  next  goal  is  to  examine  the  relative  sharpness  of  the  valley 
in  which  the  optimal  solution  resides  in  order  to  determine  just  how 
critical  ii  is  that  we  find  the  optimal  solution*  In  other  words  we 
want  to  determine  whether  or  not  there  are  other  solutions  which  in 


practice  corV  serve  as  well  as  the  optimum. 

Figure  6-9  contains  a  plot  of  the  time  costs  of  the  best  solution 
for  various  0-i'i.milies  .  We  see  from  this  plot  that  there  are  a  number 
of  solutions  which  have  time  costs  close  to  that  of  the  optimal  solution 
(0-families  JO  and  28).  In  particular,  O-families  4,  19,  37,  46,  and 
55  all  contain  solutions  which  have  time  costs  within  3  percent  of  the 
t-me  cost  for  the  optimal  solution*.  This  would  tend  to  confirm  the 
hypothesis  that  there  is  no  absolute  optimal  solution,  but  rather  a 
spectrum  of  optimal  or  nearly  optimal  solutions. 

In  order  to  obtain  some  feel  for  the  "variance"  we  can  expect  from 
one  solution  to  another  in  this  spectrum  of  solutions,  let  us  examine 
briefly  the  best  solution  for  each  of  the  0-families  indicated  above. 
Descriptions  of  these  solutions  are  contained  in  Table  6-13  in  order  of 
increasing  time  cost. 

Upon  close  inspection  of  these  various  solutions  we  find  in  each 
cas('  that  the  two  solutions  in  a  0-family  result  in  the  same  physical 
storage  structure.  Moreover,  the  solutions  for  0-famil/  19  and 
0-family  55  are  identical.  Figures  6-10  through  6-13  contain 
schematics  for  the  solutions  of  0-families  19  (and  55),  4,  37,  and  46. 
respectively. 

* 

The  best  solution  for  each  of  these  ^-families  has  a  time  c<"  -  within 
3  percent  of  that  tor  the  optimal  solution.  It  is  possible,  of  course, 
that  these  0-familit.>  might  contain  other  solutions  within  this  range 
as  well. 
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0-family:  19 

Solution  1 


vv 

0001000000 

<T'-(Sio: 

0001010000 

Al"'A10: 

1110101111 

1  1  0  0  C  0  1  1  1 

kl"-k10: 

1113  16  1111 

Solution  2 

V0io: 

0001000000 

T ' '  «io: 

0001000100 

Al'”A10: 

1110111011 

(32"’|310: 

110001001 

kr,-kio: 

1113  1116  11 

Common  Items 

m=l  m  =  18  m=l  : 

a 

p  <1 

tg  118,00 

t?  j  =289,  22 

T  =  256,42 

o 

o 

o 

c- 

CM 

II 

Y—1 

CM 

-UJ 

S  =  32494 

h  JI86.00 

Table  0-13,  Near  Optimal  Solutions  for  Case  1 
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0-family:  55 
Solution  1 


01.‘.01Q:  0100000000 
0i°  * 0  0io:  01000100°0 

At « •  •  AJ():  1011101111 

001100111 

Vk10:  13  1116  1111 
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0-family:  4 
Solution  1 

01«..01():  0000001000 

0i‘  ’  *  0io:  0000001100 

A^o-Ajq:  10  11110  0  11 

^2' '  *  %0:  0  0  1  1  0  0  0  0  1 

k«  •  •  *kin:  13  111116  11 

x  1U 

Solution  2 

01  * '  *  01O:  0000  0  01000 

0i°  *  ‘  0iO:  0000001100 

Ar”Aio:  1 1 1  o  1 1  o  o  i 1 

/32°',%0:  1  1  0  0  0  0  0  0  1 
Vk10:  1113  1116  11 

Common  Items 


m  =  1  m  =  1 8 

m  =-1 

a  p 

rl 

CO 

11 

O 

o 

t?  J  =  291  „  22 

T  =  257.41 

tl2  i  =  27?,.  00 

S  =  33398 

^1=530.00 

Table  6-13.  Near  Optimal  Solutions  for  Case  1  (Cont./ 
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0-family:  37 


Solution  1 

01*.°01O:  001010  0  000 

0i°  *  ’  0iO:  ooionoooo 

1  1  0  0  0  0  1  1  1  1 
j32°*°/31():  1  0  0  0  0  0  1  1  1 

kl"’k10:  1113  16  1111 

Solution  2 

01*--01O:  0010100000 

0i°  *  ’  0iO:  0  0  1  0  1  0  0  1  0  0 

A].° '  °  A10:  1  1  0  0  0  1  1  0  1  1 

^2 —  ^  0:  1  0  0  0  0  1  0  0  1 

kj» • *k10:  1113111611 

Common  Items 


m  =  1  m  =  18 

m  =  1  m 

a  p 

rl  r2 

tg  1=  122.00 

t?  1=  291.22 

T=  259.  14 

t1  =278.00 

1  Ci  j  1 

S  =34754 

t14  j  =  710.  GO 

Table  6-13 1  Near  Optimal  Solutions  for  Case  1  (Oont.) 


0-family;  46 
Solution  1 


V"*1C 

0011000000 

0011010000 

"o 

rH 

<1 

o 

• 

iH 

< 

1100101111 

100000111 

b*  •  o  •  Ir  • 

K1  K10‘ 

1113  16  1111 

Solution  2 

<V',0io: 

0011000000 

0011000100 

Al”'A10: 

1100111011 

^2* ' '  ,910: 

100001001 

kl"‘k10! 

1113  1116  11 

Common  Items 


m  =  1  m-18  m  =  1  m 

a  P  rl  r< 

t3  j  =  122.00 

t7  x  =  293.22  T  =  260 o  60 

tl2  x  =  274.00  S  =  33850 

^14  1  =  ^66. 00 


Table  6-13.  Near  Optimal  Solutions  lor  Case  1  (Cont.) 


tor  (^-families  19  and 


Solution  i 


Solution  for  (^-family  37  for  Case  1 


2 


Figure  6-13.  Best  Solution  f 


n 


If  we  examine  the  structures  cf  Figures  6-10  through  6-13  and 
compare  them  with  the  optimal  structure  of  Figure  6-3,  we  can  con 
elude  that  although  there  may  be  a  number  of  optimal  or  nearly  optimal 
solutions,  all  of  ihese  eolations  tend  to  cluster  about  the  optimal  solu¬ 
tion.-  That  is,  all  of  these  structures  are  basically  the  same,  differing 
only  in  relatively  minoi  respects. 

In  particular,  the  storage  s  ructures  which  we  have  considered 
here  all  contain  a  stack  of  target  description  block  indicators  for 
each  relation  symbol  associated  with  a  given  source.  Furthermore, 
these  stacks  of  description  block  indicators  and  their  respective 
relation  symbols  are  associated  with  a  given  source  via  a  ring  or 
stack  of  rings  such  that  the  overall  storage  structure  takes  the  form  of 
a  downward  branching  tree  (i.e. ,  from  source  to  target). 

6. 3  Sensitivity  Analysis 

Next  we  wish  to  determine  the  sensitivity  of  our  optimal  solution 
to  variations  in  the  problem  under  consideration.  In  particular,  we 
wish  to  determine  how  sensitive  our  solution  is  to  variations  in 
k°  •  •  *k  jq  and  to  variations  in  the  relative  frequencies  of  the  given 
prmitive  operations.  Since  the  values  of  these  parameters  must 
generally  be  determined  by  estimation,  they  are  subject  to  the  errors 
inherent  in  any  estimation  process.  We  would  hope,  therefore,  that 
the  solution  is  relatively  insensitive  to  minor  variations  in  these  values. 
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6. 3. 1  Variation  in  k°  •  •  *k°  ^ 

Let  us  first  consider  the  parameters  k ^  •  k^Q.  Recall  that  the 

values  used  thus  far  in  the  solution  of  the  diagnostic  problem  (i.e. , 
in  Case  1)  are  integer  approximations  to  the  values  we  determined  for 
the  heart  disease  problem.  (See  Tables  6-7  and  6-8.)  Table  6-14  con¬ 
tains  another  equally  valid  set  of  values  for  k°  •  •  *k^Q  in  approxi¬ 
mation  of  the  values  given  in  Table  6-7.  Let  us  now  determine  the 
optimal  solution  using  the  values  given  for  k^-  •  »k°  Q  in  Table  6-14 
and  keeping  the  values  of  all  other  parameters  (including  the  external 
decision  variables)  unchanged. 

Applying  our  program  to  these  data,  which  we  will  designate 
Case  2,  yields  the  solution  listing  contained  in  Appendix  F.  Figure 
6-14  contains  a  plot  of  the  time  costs  for  the  best  solutions  in  each  of 
the  various  0-families  and  Table  6-15  contains  descriptions  of  the 
optimal  and  near  optimal  (within  4  percent  of  the  optimum)  solutions 
in  order  of  increasing  time  cost.  As  was  the  case  before,  method  1 
is  the  best  choice  for  each  of  the  primitive  operations  for  all  solutions. 

As  in  the  previous  case,  0-families  28  and  10  contain  the  optimal 
solutions  and  0-families  4,  19,  37,  46,  and  55  contain  the  near  optimal 
solutions  with  the  same  relative  ordering.  In  addition,  0-family  1  also 
contains  a  near  optimal  solution. 

Once  again  we  find  that  all  of  the  (best)  solutions  within  a  given 
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113 


ko  =  113 

P 

ko  =  22 


m  =  4 

r 

P 


1 

1 


1 

3 

12 

1 


Table  6-14.  Alternative  Values  for  Parameters 
of  Heart  Disease  Problem 
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8 
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^-family 

Figure  6-14.  Time  Costs  by  ^-family  for  Case  2 


0-family:  28  (Optimal  Solutions) 


Solution  1 


V”0io: 

001 0000000 

d\*  •  •  ©  $  * 

TL  *10* 

0010000000 

Al'"A10: 

1001101111 

^2*  *  ‘  %o: 

000100111 

ki***kio: 

14  1113  11 

Solution  2 

V  •  •  01O: 

0010000000 

0l"'0io: 

0010000000 

VAio: 

1  0  0  1  1  1  1  0  1  t 

^2* ' '  %0: 

000101001 

"o 

i-H 

14  111113 

Common  Items 

m  =1 

m  =12  m  =  1 

a 

P  ri 

1  1 


1  1 


t3|1=  84*00 

t?  j  =  213.04 
tJ2  j  -  2C2.00 
tH  1=  344.00 


T  =  188.39 
S  =  22098 


Table  6-15.  Optimal  and  Near  Optimal  Solutions  for  Case  2 
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0-family:  10  (Optimal  Solutions) 

Solution  1 

01**.01Q:  0000100000 

•  0000100000 

HI  no1 

1011001111 
0  0  1  0  0  0  1  1  1 

kl”'ki0:  14  1113  1111 

Solution  2 


•  •  aio: 

0000100000 

•  •  (Sio: 

0000100000 

•-A10: 

1011011011 

•  •  ho- 

001001001 

"kio: 

14  111113  11 

Solution  3 

V*’01O:  0000100000 

0l'  * '  07O  :  0000100000 
Aj-'A^:  1  1  1  0  0  0  1  1  1  1 

02*  •  •  i31q:  1  1  0  0  0  0  1  1  1 

1114  13  1111 

Table  6-15.  Optimal  and  Near  Optimal  Solutions 
for  Case  2  (Cont. ) 

-  -9 


Solution  4 


V',(5io:  0000100000 

0i° 0  *  0io:  0000100000 

A1,,,A10:  1110011011 

h' ' 0  ho:  110001001 

Vk10:  1114  1113  11 

Common  Items 


rn  1 

m  =12 

m  = 

,1  m 

a 

P 

rl 

r2 

*3,1  = 

84.00 

ii 

c— 

213.04 

T  = 

188.39 

ti2,r 

202.00 

S  = 

22098 

*14,1“ 

344.00 

Table  6-15.  Opti  ml  tU  1  Near  Optimal  Solutions 
for  Jase  ‘  Cont.) 
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0-family:  19 
Solution  1 

V**01O:  0001000000 

0»o..0’o:  0001000000 

Al‘“A10:  1110  10  1111 

$2°  *  ‘  %0:  1  1  0  0  0  0  1  1  1 

V-klo'  1114131111 
Solution  2 

0  •  •  •  01©:  0001000000 

0i°°  ,0io:  0001000000 


A1-"A10: 

1  1  1  0  1  1  1  C  1  1 

<V ' '  @io: 

110001001 

]r  o  •  •  lr  • 

K1  K10’ 

1114  1113 

1  1 

Common  Items 

ma=l 

m  =12  m  =  1 

P  ri 

m  =1 
r2 

V  = 

86.00 

k7,l  = 

217.04  T  = 

191.88 

42,1" 

198.00  S  = 

20742 

t14,l  = 

236.  OC 

Table  6-15.  Optimal  and  Near  Optimal  Solutions 

for  Case  2  (Cont.) 
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p-family;  55 


Solution  1 


vv 

0100000000 

*i‘"* 10: 

0100000000 

Ai",Aio: 

1011101111 

%•  •  •  %0: 

001100111 

kl"k10: 

14  1113  11 

Solution  2 

V0io: 

0100000000 

0i"‘0io: 

0100000000 

Al'”A10: 

1011111011 

^2’ ' "  ^10: 

00110100? 

kl'"k10: 

14  111113 

1 


1 


Common  Items 


m  =1 

m  =12 

m  =  1  m 

a 

P 

rl  r2 

*3,1  " 

86.00 

V  = 

217.  04 

T  =  191.88 

*».r 

198.00 

S  =  20742 

ti4,r 

236.00 

Table  6-15.  Optimal  and  Near  Optimal 
Solutions  for  Cat.e  2  (Cont.) 


0-family:  4 


Solution  1 

0i —  01O:  0  0  0  0  0  0  1  0  0  0 

0;  •  •  •  0^o:  0  c  0  0  0  0  1  0  0  0 

Ai00*Aio:  1011110011 

•  •  (310:  0  0  1  1  0  0  0  0  1 

Vk10:  1  4  1  1  1  1  1  3  1  1 

Solution  2 

0^  ->  -  =  01 0 :  0000001000 

0i —  0io:  0000001000 

Al°  *  °  A10‘  1110110011 

^2*  *  *  %0:  1  1  0  0  0  0  0  0  1 

Vk10:  1114  11X311 

Common  Items 


m  =1 

m  =12 

m  =  1  in 

a 

P 

rl  r2 

11 

tH 

CO 

80.00 

*7,1  = 

221.04 

T  =  194.07 

ti2,r 

202.00 

S  =  22098 

ti4,r 

344.00 

Table  6-1 5.  Optimal  and  Near  Optimal  Solutions 
for  Case  2  (Cont.) 


0-family:  37 
Solution  1 

01*  •  •  01Q:  0010100000 

•  •  0^:  0010100000 

1100001111 
P2‘  *  °  j31Q:  1  0  0  0  0  0  1  1  1 

kl-k10:  H14  13  1111 

Solution  2 

0i* 0  *  0io:  0010100000 

0^**0’  :  0  0  1  0  1  0  0  0  0  0 

Al“‘A10:  1  1  0  0  0  1  1  0  1  1 

^2°  *  *  %0*  1  0  0  0  0  1  0  0  1 

kl"'k10:  1114  1113  11 


ommon  Items 

m  =1 
a 

m  =12 

P 

m  =1 

rl 

m  =  1 

r2 

V  = 

90.00 

V  = 

219.04 

T 

=  194.50 

ti2,r 

208.00 

S 

=  23454 

c-*~ 

h-i 

b—*- 

II 

464.00 

Table  6-15,  Optimal  and  Near  Optimal  Solutions 
for  Case  2  (Cont.) 
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0-family:  1 


01-°-0lo:  0000000000 
0i’  *  *  01O:  000100  0  000 

* 0 Aio:  1110101111 

P2°  •  •  /31q:  1  1  0  0  0  0  1  1  1 


V‘*k10:  1  1  1  4  1  3  1  1  1  1 


m  -1 
a 

m  =12 

P 

m  =1 
rl 

m 

r2 

*3,1  =: 

83.00 

*7,1  = 

221.04 

T  = 

194.51 

t12,l~ 

178.00 

S  = 

18482 

ii 

rH 

-4-> 

236.00 

Table  6-15,  Optimal  and  Near  Optimal 
Solutions  for  Case  2  (Cont. ) 
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0-family:  46 
Solution  1 


V’W 

0011000000 

0T  * '  0io: 

0011000000 

1  1  0  0  1  0  1  1  I  1 

hm  ’  ’  6o: 

100000111 

ki"'kio: 

1114  13  11 

1  1 

Solution  2 

V"0io: 

0011000000 

0i"'0io: 

0011000000 

41"‘A10: 

1100111011 

<V '  ‘  6o: 

100001001 

V"kio: 

1114  1113 

1  1 

Common  Items 

m  =  1 

m  -  12  m  =  1 

m  =: 

a 

P  *1 

r2 

6,i  = 

90.00 

6,i  = 

221.04  T  = 

196.00 

,= 

202.00 

12,1 

O  = 

22098 

*■14  1  = 

356.00 

f 


Table  6-15.  Optimal  and  Near  Optimal  Solutions 
for  Case  2  (Cont. ) 
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0-family  result  in  the  same  physical  storage  structure..  Moreover, 
the  structures  for  0-families  10  and  28  are  again  identical,  as  are 
the  structures  for  0-families  19  and  55.  Figures  6-15  through  6-20 
contain  schematic  representations  for  the  solutions  of  0-families 
28  (and  10),  19  (and  55),  4,  37,  1,  and  46,  respectively. 

Upon  close  inspection  of  the  storage  structures  for  0-families  28, 
19,  4,  37,  and  46  we  find  that  they  are  the  same  as  the  structures  ob¬ 
tained  for  those  0-families  in  the  previous  case  with  the  exception  that 
the  stacks  of  description  block  indicators  do  not  contain  head  pointers. 
The  lack  of  these  head  pointers  is  due,  of  course,  to  the  reduced 
number  of  description  block  indicators  in  each  stack. 

We  also  see  that  the  additional  near  optimal  solution  (in  0-family  1) 
has  the  same  basic  form  as  the  others.  Instead  of  having  a  ring  (or 
stack  of  rings)  or  stacks  of  description  block  indicators,  this  structure 
uses  a  stack  for  the  stacks  of  description  block  indicators. 

For  the  purposes  of  comparison  we  have  determined  the  time  of  the 
Case  1  optimal  solution  given  the  conditions  of  Case  2.  The  value  of 
this  time  cost  is  188. 41,  whereas  the  time  cost  of  the  Case  2  optimal 
solution  (given  the  conditions  of  Case  2)  is  188.39. 

Thu:  ,  we  may  conclude  that  our  solution  is  relatively  insensitive 
to  minor  variations  in  the  values  assigned  to  the  parameters  k° «  •  *k^Q. 
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Figure  6-16.  Best  Solution  for  ^-families  19  and  55  for  Case  2 
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Figure  6-20,  Best  Solution  for  6-family  46  for  Case  2 
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6.3.2  Variation  in  the  Frequencies  of  Operations 

Let  us  now  consider  the  effects  of  variation  in  the  relative  fre¬ 
quencies  of  the  given  primitive  operations.  Instead  of  the  relative 
frequencies  used  for  Case  1,  suppose  that  we  were  to  use  the 
following  values: 


Operation 

Relative  Frequency 

0.20 

Q,(di-Pk) 

0.70 

w-> 

0.05 

0.05 

These  values  tend  to  assign  more  weight  then  before  to  the  less 
significant  operations. 

Keeping  the  values  of  all  other  parameters  fixed  as  they  were  for 
the  Case  1  and  applying  our  program  to  these  data,  which  we  will 
designate  Case  3,  yields  the  solution  listing  contained  in  Appendix  G. 
Figure  6-21  contains  a  plot  of  the  time  costs  for  the  best  solutions  in 
each  of  the  various  0-familieso 

It  appears  for  this  case  that  the  optimal  solution  lies  in  a  some¬ 
what  sharper  valley  since  there  are  fewer  solutions  within  3  to  4  per¬ 
cent  of  the  optimum  (although  this  fact  may  not  be  overly  significant). 
We  see  once  again  chat  the  optimal  and  near  optimal  solutions  are 
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Figure  6-21.  Time  Costs  by  ^-family  for  Case  3 


concentrated  in  the  same  0-families,  as  before,  however.  In  parti¬ 
cular  the  0-families  containing  solutions  within  3  percent  of  the  opti¬ 
mum  are  4,  1C,  IS,  28,  and  55. 

We  see  also  that  these  solutions  are  identical  to  those  in  the 
corresponding  0-families  for  Case  1.  The  only  difference  is  in  the 
ordering  of  these  solutions  (by  time  cost).  For  this  case  0-families 
19  and  55  contain  the  optimal  solution,  followed  by  the  solutions  in 
0-families  10  and  28  and  in  0-family  4,  in  that  order. 

We  see  that  given  the  conditions  of  Case  3  the  Case  1  optimal 
solution  has  a  time  cost  of  262.  95  versus  a  time  cost  of  258. 85  for 
the  Case  3  optimal  solution. 

We  conclude,  therefore,  tha*  our  solution  is  also  relatively 
insensitive  to  minor  variations  in  the  relative  frequencies  assigned 
to  the  given  primitive  operations,  although  it  is  perhaps  more  sensi¬ 
tive  to  variations  in  the  values  of  these  parameters  than  to  variations 
in  the  /alues  of 

6.3,3  Solution  for  the  Primary  Operation 

We  may  find  it  instr  uctive  to  compare  the  solutions  we  have  obtained 
using  a  weighted  sum  of  the  time  costs  for  several  operations  and  aver¬ 
age  statistics  for  relations  witli  a  fairly  large  variance  with  the  solu¬ 
tions  we  might  obtain  for  a  single  primitive  operation  and  statistics 
for  only  tno  relations  with  which  that  operation  ic  intimately'  concerned. 


In  particular,  let  us  consider  the  primitive  operation  (which 
has  the  greatest  relative  frequency  in  our  other  examples)  and  the 
relations  r° •  •  •  r°Q.  We  can  easily  determine  that  the  relations 
r°  •  *  •  r°Q  will  result  in  the  statistics  of  Table  6-16,  (See  the  deriva¬ 
tion  of  the  results  of  Table  6-7  for  the  procedure  involved.)  Table 
b-17  contains  a  suitable  integer  approximation  of  these  statistics. 

Appendix  H  contains  the  solution  listing  for  this  problem  and 
Figure  6-22  contains  a  plot  of  the  time  costs  for  the  best  solutions  in 
the  various  0-families  „ 

The  optimal  storage  structure,  which  is  given  by  the  solutions  for 
0-families  10  and  28,  appears  in  Figure  6-23.  The  0-families  which 
contain  solutions  with  time  costs  within  3  percent  of  the  optimum  are, 
in  order  of  increasing  time  cost,  19,  55,  1,  37,  and  46. 

From  this  information  it  is  clear  that  in  our  earlier  examples 
operation  has,  because  of  its  weight,  a  very  marked  effect  upon 
deciding  what  the  optimal  structure  is. 

6. 4  Summary 

Let  us  conclude  this  example  by  summarizing  its  main  points. 
First,  we  have  snown  that  application  of  our  procedure  to  a  particular 
problem  can  result  in  the  specification  of  a  storage  structure  signifi¬ 
cantly  more  efficient  than  a  storage  structure  determined  by  other 
less  rigorous,  qualitative  methods. 
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Table  6- lb, 


k°  =  50 

a 

k°  =  33 

P 

k°r  =  2b 

m  =  2. 86 

r 
P 


i 

k°.  =  6.64 

4 


k°  =  2.  82 

k°  =  28.39 


1 


Characteristics  of  Relations  "  •  -  r° 
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Table  6-17.  Integei'  Approximation  of  Statistics 
in  Table  6-16. 
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Figure  6-22.  Time  Costs  by  ^-family  for  Operation  Q,, 


Figure  6-23.  Optimal  Solution  for  Primitive  Operation 
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Secondly,  we  have  illustrated  that  the  solution  obtained  is  relatively 
insensitive  to  minor  variations  in  the  values  of  the  parameters  k°*  *  ek°  ^ 
and  the  relative  frequencies  of  the  given  primitive  operations.  This  is 
of  particular  significance  since  the  estimation  of  these  values  may  be 
somewhat  difficult  for  problems  involving  large  numbers  of  data  items 
and  relations. 

Finally,  at  least  within  the  context  of  the  problem  considered,  we 
have  demonstrated  that  the  optimal  storage  structure  is  very  much 
dependent  upon  the  operation  having  the  greatest  weight.  This  infor¬ 
mation  may  be  of  importance  if  all  the  operations  to  be  used  in  con¬ 
junction  with  the  structure  are  not  known  with  complete  certainty  -  the 
primary  operation  (or  operations)  may  then  be  used  to  obtain  a  solu¬ 
tion  which  can  be  at  least  near  optimal  in  terms  of  the  overall  problem. 
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Chapter  VII 


CONCLUSION 

The  objective  of  the  research  reported  here  has  been  the  develop¬ 
ment  of  a  rigorous  quantitative  method  for  the  automatic  design  of 
optimal  storage  structures  for  the  representation  of  data  within  a 
computer  memory. 

To  accomplish  this  objective  we  partitioned  the  problem  into  four 
basic  parts.  First,  in  order  to  provide  a  framework  within  which  to 
work,  we  defined  a  relational  model  of  data  structure.  To  apply  this 
model  to  a  particular  problem,  the  user  (i.e. ,  the  problem  solver) 
must  determine  what  data  items  are  involved  in  the  solution  process 
he  has  chosen  and  what  relations  (of  accessibility)  relate  the  various 
data  items  to  one  another.  He  must  then  determine  the  (average) 
cardinalities  of  the  various  sets  which  we  have  defined  for  the  model. 
These  cardinalities  in  essence  give  us  a  measure  of  the  redundancy 
or  the  "share-ability”  of  the  relations  among  the  data. 

Second,  we  developed  a  model  tor  the  specification  of  the  storage 
structures  which  can  represent  an  arbitrary  data  structure  as  given  by 
the  data  structure  model.  This  storage  structure  model  consists 
primarily  of  a  set.  of  decision  variables,  which  are  used  to  specify  the 
structural  form  of  a  storage  structure,  and  a  set  of  parameters,  which 
are  used  to  chai aeterize  the  environment  (i.e. ,  the  computer)  in  which 
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the  storage  structure  is  to  exist. 

As  a  third  step,  we  defined  two  basic  measures  of  performance  - 
a  time  cost  function  and  a  storage  cost  function  -  for  use  in  comparing 
the  relative  merits  of  a  collection  of  storage  structures.  The  time 
cost  function  reflects  the  number  of  time  units  required  to  pexTorm 
certain  primitive  operations  using  a  particular  storage  structure, 
where  the  problem  solver  chooses  the  primitive  operations  of  interest 
from  a  standard  collection  c  these  primitive  operations  and  assigns 
weights  to  them  to  reflect  their  relative  importance  in  tne  problem 
solution.  The  storage  cost  function  simply  reflects  the  total  number 
of  storage  units  occupied  by  the  storage  structure  of  interest. 

Finally,  we  presented  a  procedure  which  (automatically)  compares 
the  time  and  storage  costs  for  those  storage  structures  which  are 
members  of  the  set  of  feasible  storage  structures  (for  a  given  data 
structure)  and  determines  the  storage  structure  for  which  these  costs 
satisfy  certain  optimality  conditions.  For  our  purposes,  a  storage 
structure  is  considered  to  be  optimal  if  it  minimizes  the  time  cost 
function,  subject  to  the  constraint  that  its  storage  cost  is  less  than 
some  bound. 

In  order  to  demonstrate  the  feasibility  and  the  effectiveness  of  the 
techniques  which  we  developed,  we  applied  them  to  a  problem  for 
which  a  solution  program  had  already  been  implemented  and  for  which 
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fairly  extensive  data  about  system  performance  were  available.  The 
results  we  obtained  were  enlightening  and  demonstrate  conclusively 
that  the  approach  we  have  taken  is  not  only  feasible,  but  desirable. 

First,  we  were  able  to  show  that  the  storage  structure  which  we 
determined  to  be  optimal  reduces  by  a  factor  of  approximately  2.5  the 
time  required  to  solve  a  typical  problem,  as  compared  with  the  storage 
structure  implemented  for  the  existing  system.  The  optimal  storage 
structure  also  requires  approximately  two -thirds  the  storage  required 
by  the  existing  storage  structure,  even  though  we  made  no  particular 
attempt  to  minimize  storage  and  assumed  in  fact  no  limit  on  the  stor¬ 
age  available.  Although  such  significant  improvement  in  system 
performance  may  not  always  result,  these  figures  do  give  some  indi¬ 
cation  of  the  potential  of  a  rigorous  approach  to  storage  structure 
design. 

Next  we  considered  the  "uniqueness"  of  the  optimal  storage  struc¬ 
ture,  The  results  we  obtained  indicate' that  there  are  a  number  of 
storage  structures  with  time  costs  within  3  or  4  percent  of  the  time 
cost  for  the  optimal  solution  (and  with  essentially  the  same  storage 
costs).  This  would  tend  to  indicate  that  there  is  no  single  storage 
structure  which  should  be  considered  the_  optimum,  but  rather  that 
there  are  a  number  of  near-optimal  storage  structures,  any  one  of 
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which  would  serve  as  well  as  the  others.  To  a  certain  extent,  this 
is  true.  We  found  on  the  other  hand,  however,  that  all  of  these  near- 
optimal  storage  structures  were  very  nearly  identical  to  the  storage 
structure  determined  to  be  the  optimum.  This  fact  tends  to  favor 
the  idea  of  a  single  storage  structure. 

We  also  considered  the  sensitivity  of  our  solution  to  variations 
in  the  values  of  those  parameters  most  subject  to  error  -  namely, 
the  a»  erage  values  of  the  cardinalities  of  the  data  structure  sets  and 
the  relative  frequencies  of  the  primitive  operations.  The  values  of 
these  parameters  must  be  estimated  by  the  problem  solver  and,  hence, 
are  subject  to  the  errors  inherent  in  any  estimation  process.  This 
will  be  particularly  true  for  the  cardinalities  of  the  sets  when  the  data 
structure  involves  very  large  numbers  of  data  items  and  relations. 

Our  results  indicate,  however,  that  the  solution  we  obtain  is  relatively 
insensitive  to  minor  variations  in  either  of  these  two  sets  of  para¬ 
meters. 

Finally,  we  examined  the  solution  obtained  by  considering  only  the 
primitive  operation  with  the  greatest  relative  frequency  (0.8  out  of  1.0) 
and  compared  this  result  with  the  previously  determined  solution.  On 
the  basis  of  this  comparison  we  concluded  that  when  all  the  primitive 
operations  for  a  particular  problem  are  not  known  with  a  great  degree 
of  certainty,  we  can  obtain  a  solution  which  may  be  very  nearly  optimal 
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by  considering  only  the  principal  primitive  operation  or  operations 
(which  are  presumably  known). 

To  further  investigate  the  sensitivity  of  the  results  of  our  storage 
structure  design  procedure,  it  might  be  desirable  to  consider  a  more 
extensive  variety  of  problems,  perhaps  a  broader  range  of  variation  in 
the  values  of  the  parameters  to  which  the  sensitivity  calculations  apply, 
and  possibly  variations  in  the  values  of  additional  parameters  (such  as 
the  relative  times  to  follow  a  pointer  and  to  step  through  a  stack).  We 
cannot  argue  with  the  potential  value  of  such  investigations.  We  can 
only  point  out  that  each  variation  in  the  problem  to  be  solved  requires 
performing  the  complete  solution  process,  which  for  our  prototype 
program  involves,  while  not  excessive,  a  nontrivial  amount  of  computer 
time.  In  particular,  solution  of  our  example  problem  involving  four 
primitive  operations  required  approximately  7.3  minutes  of  CPU  time 
on  the  IBM  360/67.  This  suggests  that  one  should  at  least  choose  his 
examples  and  the  values  of  his  parameters  for  these  investigations  in 
a  prudent  manner. 

While  on  the  subject  of  the  amount  of  computer  tine  required  by 
the  solution  process,  we  might  point  out  that  the  amount  of  time  re¬ 
quired  is  completely  independent  of  the  numbers  of  data  items  and 
relations  (and  the  corresponding  cardinalities  ox  the  data  structure  sets) 
and  is  a  function  primarily  of  the  number  of  primitive  operations 
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characterizing  the  problem  to  be  solved.  This  follows  directly,  of 
course,  from  the  nature  of  our  time  cost  function. 

Let  us  now  consider  some  of  the  strengths  and  weaknesses  of 
certain  design  considerations  employed  in  developing  our  procedure. 

In  order  to  apply  the  procedure  developed  here  to  his  particular 
problem,  the  user  must  specify  the  data  structure  which  characterizes 
his  problem.  This  involves  specifying  the  numbers  of  data  items  and 
relations  involved  in  his  problem,  as  well  as  the  average  cardinalities 
of  the  various  data  structure  sets. 

Before  he  can  even  begin  to  determine  these  quantities,  however, 
the  user  must  rigorously  define  the  solution  process  which  he  intends 
to  use  for  his  problem  and  in  doing  so  must  choose  the  relations  which 
will  characterize  his  data  structure.  By  assuming  that  we  must  be 
given  the  data  structure  specification  as  the  starting  point  of  our 
storage  structure  design  procedure,  we  have  completely  ignored  this 
facet  cl  the  user’s  system  design  problem.  Specification  of  a  solution 
process  is,  of  course,  another  of  the  areas  of  computer  systems 
design  for  which  there  is  a  paucity  of  formal,  objective  techniques  for 
evaluation  and  comparison  of  the  effects  of  alternative  decisions.  Since 
the  storage  structures  specified  by  our  procedure  are  strongly  influenced 
by  the  user's  choice  of  relations  (i.e„ ,  y  his  choice  of  solution  process), 
this  matter  is  clearly  of  importance  to  us  in  spite  of  our  assumed 
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disregard  for  it. 

Let  us  discuss  for  a  moment  the  implications  of  using  average 
values  for  the  cardinalities  of  the  various  data  structure  sets.  In  the 
first  place,  using  exact  values  for  these  cardinalities  is  completely 
out  of  the  question  in  all  but  the  simplest  of  situations.  Determining 
exact  values  would  be  tantamount  to  constructing  a  schematic  of  the 
entire  data  structure  for  a  given  problem,  and  applying  our  procedure 
to  a  data  structure  model  utilizing  exact  values  would  essentially  be 
equivalent  to  completely  solving  the  user's  problem  for  every  storage 
structure  considered.  Clearly,  even  for  problems  of  moderate  size, 
neither  of  these  steps  can  be  accomplished  through  a  reasonable 
amount  of  effort. 

On  the  other  hand,  determining  average  values  for  the  cardinali¬ 
ties  can  be  done  quite  easily  (as  we  have  shown  in  our  example). 
Furthermore,  using  average  values  tends  to  minimize  the  amount  of 
information  hich  must  be  supplied  by  the  user. 

One  might  criticize  the  use  of  average  values  if  the  variances  of 
the  various  cardinalities  are  large,  but  it  may  be  possible  in  .,ach  cases 
to  partition  the  problem  (i.e. ,  the  dau  structure)  into  sub-problems 
for  which  the  variances  are  not  so  great,  and  then  to  determine  the 
optimal  storage  structure  for  each  of  these  sub-problems. 

Closely  allied  with  the  specification  of  the  relations  and  the  set 
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cardinalities  for  the  data  structure  is  the  selection  of  primitive 
operations  and  their  relative  frequencies.  Since  we  have  provided 
only  a  limited  number  of  primitive  operations,  the  user  may  encoun¬ 
ter  some  difficulty  in  trying  to  match  exactly  his  operations  with  the 
primitive  operations.  The  obvious  solution  to  this  problem  is.  of 
course,  to  define  additional  primitive  operations.  This  may  or  may 
not  be  desirable,  depending  upon  how  great  the  mismatch  is. 

Perhaps  the  biggest  weakness  in  our  consideration  of  the  storage 
structure  problem  is  the  neglect  of  manipulative  operations.  Most 
certainly  the  updating  and  alteration  of  the  information  contained  in  a 
storage  structure  can  have  a  profound  effect  upon  the  form  of  that 
storage  structure.  Again,  the  obvious  solution  tr  this  problem  is 
simply  to  define  additional  primitive  operations  to  cover  the  operations 
desired.  This  can  certainly  be  done,  but  development  ot  the  corre¬ 
sponding  time  cost£  would  definitely  require  a  nontrivial  amount  of 
effort. 

We  hasten  to  point  out  that  there  are  a  multitude  of  problems  which 
involve  only  the  interrogate  n  of  a  static  uata  base.  Clearly,  our  pro¬ 
cedure  can  be  applied  to  these  problems  without  the  necessity  of 
adding  manipulative  operations  to  oui  set  >f  primitive  operations. 

One  other  matter  with  which  one.  might  take  issue  is  our  choice  of 
optimality  conditions.  Although  other  choices  may  bt  made,  we  feel 
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that  minimization  of  the  time  cost  function  subject  to  a  limit  on  the 
storage  available  is  the  most  universally  applicable. 

Finally,  we  have  assumed  that  the  storage  structures  considered 
by  our  design  procedure  are  to  reside  in  the  primary  store  of  the 
computer.  By  introducing  appropriate  constraints  and  by  choosing 
proper  values  for  the  parameters  representing  time  cost  characteris¬ 
tics,  it  should  rlso  be  possible  to  apply  the  model  developed  here  to 
problems  for  which  the  storage  structures  are  tr  reside  in  the  secondary 
store  or  a  hierarchy  of  storage  devices. 
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METHODS  OF  IMPLEMENTATION  FOR 
THE  PRIMITIVE  OPERATIONS 


Operation  Q..:  d.  r.  p 
j.  1  3  K 

Method  1 


Search 

Search 

If  r.  is 
3 


for  source  d.. 

i 

for  among  the  associated  relation  symbols, 
found,  search  for  p^  among  the  associated  targets. 


Method  2 


Search  for  relation  r.. 

3 

Search  for  di  among  the  associated  sources. 

If  d.  is  found,  search  for  p^  among  the  associated  targets. 

Method  3 


Search  for  relation  r.. 

3 

Se^v-h  for  p^  among  the  associated  targets. 

If  p^  is  found,  search  for  d.  among  the  associated  sources. 

Method  4 


Search 

Search 

If  r.  is 
3 


for  target  pk< 

for  r.  among  the  associated  relation  symbols. 

3 

found,  search  for  d.  among  the  associated  sources. 
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Operation  Q0:  d.  r.  * 

Ct  11 

Method  1 

Search  for  source  d^ 

Search  for  r.  among  the  associated  relation  symbols. 
Method  2 

Search  for  relation  r.. 

] 

Search  for  d^  among  the  associated  sources. 


Operation  Q^:  di  r  - 
Method  1 

Search  for  source  d^. 

Search  for  r.  among  the  associated  relation  symbols. 
Determine  all  associated  targets. 

Method  2 

Search  for  relation  r.. 

1 

Search  for  d.  among  the  associated  sources. 
Determine  all  associated  targets. 

Method  3 


Determine  all  targets. 

For  each  target,  search  for  r.  among  the  associated 
relation  symbols. 

If  r.  is  found,  search  for  d.  among  the  associated  sources. 
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Operation  Q^:  *  r. 

Method  1 

Search  for  relation  r.. 

3 

Search  for  p^  among  the  associated  targets. 
Method  2 


Search  for  target  p^ 


Search  for  r.  among  the  associated  relation  symbols. 


Operation  Q,.:  -  r  p^ 

Method  1 

Search  for  relation  r.. 

3 

Search  fur  p^  among  the  associated  targets. 

Determine  all  associated  sources. 

Method  2 

Search  for  target  p^. 

Search  for  r.  among  the  associated  relation  symbols. 
Determine  all  associated  sources. 

Method  3 

Determine  all  sources. 

For  each  source,  search  for  r.  among  the  associated 
relation  symbols.  ^ 

If  r.  is  found,  search  for  p,  among  the  associated  targets. 
1  K 
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Operation  Qg:  d.  * 


Method  1 

Search  for  source  d^. 

Search  for  p^  among  the  associated  targets. 
Method  2 

Search  for  target  p^. 

Search  for  d^  among  the  associated  sources. 


Operation  Q^:  d.  -  p^ 

Method  1 

Search  for  source  d.. 

X 

Determine  all  associated  relation  symbols. 

For  each  relation  symbol,  search  for  p,  among  the 
associated  targets. 

Method  2 

Search  for  target  p^. 

Determine  all  associated  relation  symbols. 

For  each  relation  symbol,  search  for  d.  among  the 
associated  sources.  1 

Method  3 

Determine  all  relations.; 

For  each  relation,  search  for  source  d.. 

’  i 

If  d.  is  found,  search  for  p,  among  the  associated  targets. 

1  K 

Method  4 


Determine  all  relations. 

For  each  relation,  search  for  target  p^. 

If  p,  is  found,  search  for  d  among  the  associated  sour 

3  o  5 


ce 


Operation  Q0:  -  r.  * 

°  ] 

Method  1 

Search  for  relation  r.. 

3 

Determine  all  associated  sources. 

Method  2 

Determine  all  sources. 

For  each  source,  search  for  r.  among  the  associated 
relation  symbols.  ^ 


Operation  Q^:  *  r.  - 
Method  1 

Search  for  relation  r.. 

j 

Determine  all  associated  targets. 

Method  2 

Determine  all  targets. 

For  each  target,  search  for  r.  among  me  associated 
relation  symbols. 


Operation  :  -  r.  - 
Method  1 

Search  for  relation  r.. 

3 

Determine  all  associated  sources, 
Determine  all  associated  targets. 


Method  2 


Search  for  relation  r.. 

J 

Determine  all  associated  targets. 

Determine  all  associated  sources. 

Method  3 

Determine  all  sources. 

For  each  source,  search  for  r.  among  the  associated 
relation  symbols.  ^ 

If  r .  is  found,  determine  all  associated  targets. 
Method  4 

Determine  all  targets. 

For  each  target,  search  for  r.  among  the  associated 
relation  symbols.  ^ 

If  r.  is  found,  determine  all  associated  sources. 

3 

Operation  Q^:  d^  -  * 

Method  1 

Search  for  source  d.. 

i 

Determine  all  associated  relation  symbols. 

Method  2 

Determine  aJ  relations. 

For  each  relation,  search  for  d.  among  the  associated 
sources.  1 
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Operation  Q12:  d,  *  - 
Method  1 

Search  for  source  d.. 

1 

Determine  all  associated  targets. 

Method  2 

Determine  all  targets. 

For  each  target,  search  for  d.  among  the  associated 
sources. 

f 

Operation  Q^:  d.  -  - 
Method  1 

Search  for  souice  d. . 

i 

Determine  all  associated  reh  on  symbols. 

Determine  all  associated  targets. 

Method  2 

Determine  all  relations. 

For  each  relation,  search  for  d.  among  the  associated 
sources. 

If  d.  is  found,  determine  all  associated  targets. 

Operation  Q^':  -  *  p^ 

Method  1 

Search  for  target  p  , 

Determine  all  associated  sources. 


"  7)8 


Method  2 


Determine  all  sources. 

For  each  source,  search  for  among  the  associated 
targets. 

Operation  Q15:  *  -  pk 
Method  1 

Search  for  target  pfe. 

Determine  all  associated  relation  symbols. 

Method  2 

Determine  all  relations. 

For  each  relation,  search  for  p,  among  the  associated 
targets. 

+ 

Operation  O^*  -  - 
Method  1 

Search  for  target  p^. 

Determine  all  associated  relation  symbols. 

Determine  all  associated  sources. 

Method  2 

Determine  all  relations. 

For  each  relation,  search  for  among  the  associated 
targets. 

If  p^  is  found,  determine  all  associated  sources. 

i- 

Two  additional  methods  could  be  defined  for  each  of  these  operations 
(Qj2  and  Q  J  but  were  judged  always  to  be  more  costly  than  the  methods 
given  here  \ since  each  involves  searching  the  entire  storage  structure) 
and  are,  therefore,  not  considered.; 
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Appendix  B 


TABULATION  OF  e.  FOR  i  e  {1,2,  —  ,  10} 

Each  of  the  functions  e .  for  i  e  {1 , 2,  •  *  • ,  10}  is  presented 
here  in  tabular  form  as  a  function  of  g, .  < .  Ag, . .  • ,  Ajq  . 

The  variables  for  which  values  are  unspecified  in  a  given  row 
of  a  table  are  free  to  assume  either  the  value  1  or  the  value  0 
(subject,  of  course,  to  the  constraints  of  Chapter  HI).  Note,  however, 
that  values  specified  for  2>  •  •  ■  >$10  Per*ain  to  all  succeeding  rows 
of  a  table  (even  though  not  explicitly  specified)  until  new  values  are 
assigned  to  them,, 

For  all  (legal)  combinations  of  values  not  covered  by  its 
corresponding  table  a  function  has  the  value  0„ 
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Table  B-l„  Elementary  Time  Cost  e 
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Table  B-E  Elementary  Time  Cost  e„  (Cont.) 
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Table  t> -1  Elementary  Time  Cost  e_  (Cont.) 
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Table  B-2.  Elementary  Time  Cost  e 
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Table  B-3.  Elementary  Time  Cost  e 


Table  B-4.  Elementary  Time  Cost  e 
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Taule  B-b.  Elementary  Time  Cost  e 
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Table  B-6.  Elementary  Time  Cost  e 
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Table  B-6.  Elementary  Time  Cost  eQ  (Cont.) 
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Table  B-6.  Elementary  Time  Cost  eQ  (Cont.) 
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Table  B--7.  E*eme.itarv  Time  Cost  e 
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Table  B-7 .  Elementary  Time  Cost  e„  (Cont.) 
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Table  E  8.  Element  ar>  Time  Cost  e 
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Table  B-9.  Elementary  Time  Cost  e 
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Table  B-10.  Elementary  Time  Cost  e 


Appendix  C 


SUMMARY  OF  z.(t)  and  z*(t)  FOR  i  €  [l ,  z,  •  •  • ,  10} 

zl^  =e6  +  e5  +K33  f  e5+X4(e4  +  e3)1 
K32  f  X4e3  +  X2  ^e2  fel^ 

K31  f  X2el  +  +  Z1 

where 

Zj  -{(W6V^)Vi6%VA6A5W4V^W---V  V5A4Wi} 

+  (^6  V  V(5  A5<^*4  V4V  ’  ’  v'V6ASA4Wl> 


+^33^  V^A^^y^A 4^3  ( $2^$  V"  *  ’  V^A^AgAgA^) 


+(0404<^V'/0404A3v^V  0g)W>4^4  AgA^  V  ^  AgA^)] 


+K32  [  (03(02V02)V03A201V03A2A1) 


(02^2  01V0202A1)] 


z2(*)  =e°  +  K33[  ^  V<  +  e3°>] 

+  K32t  X4e3  +  X2*e2  ,el^ 

+  K31  [  Xfi  +  t]  +  *» 

where 

z2  =  (( VA5(04V04  >VA5A403VA5A4A3(02V02 )V' ' ' V  A5A4A3A2Al' 

+  k33  [  (05(04v  0;)  v05a403v05a4a3(02V0'2)V-  •  •  v'05a4a3a2a1) 

+(*)40403v04'04  A3<O2/02'  >,0404  A3A201,^0404A3A2A1^ 

+  K32[(03(02V0')V03A20iV03A2A1) 

+  (020'201V020^A1)] 

+  K31  r  0ll  >so 
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z3(t)  =  e5  +  e6  +  K34f  e6+VeVe8^ 

+  K35  t  X7e8  +  Ve  9  +  e  10^ 

+  K36  [  AgeJ0  +  t  ]  +  z° 

where 

z°3  =  ((( WVA506VA5A6(0'/y  9  V*  ‘  ‘ '  A5A6A7A8A9A10) 

+  (05W05^5A6(07V^  V/0505A6A708V*“*V05^5A6A7A8A9AlO^ 

+  K34  [  (06(07v07)V06A708V06A7A8(09\  0g) ..  •  •  •  „06A7A8A9A1o) 

' ^7W  0707A8(09V0y V0707A8A901OV  WsVlO^ 

+  K35^  ^08^09V0yV08A90lOV08A9AlO^ 

+  (0900010^  0909  A10^ 

+  K36  f  01O^  ^  So 
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z4(t)  =e°6  +  K34f  WV6^ 


+  Kg5  [X?eJ  +  Xg(e*  +e°Q)] 
K36  t  X9e10  +  ^  +z4 


where 

z4  ■  { <*6  V  «>7V^)  VA8A708VA6A7A8(SS9V0J,  )V-  •  •  va6a,a8a9a10) 

+  K34  [  (06( tflltj)  ,  06A708V06A7A8(v'9V0PV  •  -\*I6A7A8A9A10) 
+  (07^08'/07^a8(09v0^)V07?’ a8a901o/07?7a8a9a1o)] 

+  K35  [  V08A9 01OY08A9A1(P 


+  K36  f  0iol  >so 
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z5(t)  =  (mr  +  1)  Fr  +  mr  z2(t) 


z6(t)  =  (mr+l)  Fr  +  mr  z4(t) 

z°  =  m  z°. 

6  r  4 


380 


z7(t)  -  (ay-l)  Fa  +  ma{e*  +  e° 

+  K12^  e2  +Ve3  +  e4'j 

+  K13  [  X3e^  +  t]  }  +  z° 

where 

z°?  =ma  {((01V0pVA102VA1A2(03\/0j)VA1A2A304 

^A1A2A3A4^/05^05^A1A2A3A4A501^ 

+  (VlV¥lA2(,}3Y  03)V0101  A2A3P4 

v  01^a2a3a4(p1v  05  v0y  v0101a2a3a4a5p1) 

+  K12  t  ^02^03N/03‘*  v02A3  04  ^  02A3  A4  ^/02A3  A4  A  5^  1^ 

+  ( 03  0g  04\/  03  A 4  (p ^5^5)  ^3^3  A4A5^i'^ 

+  k13  L  (P4(Piv05^’5)\'^4A5Pi) 

+  }  So 
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Z8(t)  =(mp  +  l)Fp  +  mp{e*0  +  e°8 

+  K55  t  e9  +Ve*8  +e?' 
+  K54[X8e'+t]}+z» 


where 


z8  *  mp  t«  0!O''1)io)'/AlO09VAlOA9(08V08)'/AlOA9A807 
ya10A9A8A7  ^3'/06V06'VAlOA9A8A7AePp 

+  (0lO0lo 

y  01O01O  A9A8A7^06v06*  V0lO^io  AD  a8A7A6®3* 


+  K55f  (09(08V08)V09A807V09A8A7<03V'WV09A8A7Aep3) 

+  (08W  \  08  N^W  V0808  V^l 


+  K54  t  (07<p3V06V06)V07A8p3) 

+  <  K 
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zt(t)  *  (ma  +  1)  Fa  +  mje*  +  e° 

+  k12[  e2  +  X3(e*3  +  e°)] 

+  Kl3tV4  +  X5(e*5+e6)] 

+  Ku  [  X5e'6  +  X7(e*,+  e°8)] 

+  K15  t  X7°8  +  X9(e  9  +  e  10^ 

+  K16  t  VlO  +  *  1 }  +  z9 

where 

z9  =  ma  {<(01v/«s1)va102v^1a2(«s^0^)v- •  "'',a1a2a3a4a5a6a7a(1a9a10) 
+(01«»-«(2/01ijA2(03V0^...V010jA2A3A4A5A6A7A8A9Alo) 

+  K12  I  V- ' -V  W4  VfftWlfl' 

+(03?304V03^A4'05V05)'/'  ‘ '  v030'3A4A5A6A7A8A9Alo'  ^ 

+  k13  [(04(05V0'5)v04a506v04a5a6(0^0^)v  •  •  v04a5a6a,a8a9a1o) 
+<050506’ 0505a6(07n  tyv  •  •  •  v  050^a6a7a8a9a1o)] 

+  k14  t  <0«(  08v06A7A8(03v 09V V‘  ’ -  Y06A7A8A9AlOi 

+(070^0eV070^A8(09\0y,  •  •  -X  070^A3A9A1O)] 

+  K15[(08(09V0')V08A90iOV08A9aiO) 

+  ('49090iov  0909A1O^ 

+  K16[  V>So 
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Z10(t)  =  <mp  +  V  FP+  mp{e  10+  e°9 

+  iW  e9+X8(e8  +  e7^ 

+  K54  f  X8e7  +  Ve*6  +  e5^ 

+  K53  [  X6e5  +  X4^e4  +  e3^ 

+  K52  f  *4e3  +  X2  2  +  6 1^ 

+  K5ltX26l+t'}  +z°10 

where 

Z10  =  mp  ^((0lOV0io)VAlO09YAlOa9(08V08)v-  •  •  VAUJVWW!|Wl) 
+<0lO0lO09V0lO0ioA9(08V08)V-  ’  ” V  VloWA&WA) 

+  Kgg  [  (09(08^8)V'09A807','09A8A; 06> A '  "'WWjWl1 
+(080807V0B08A7(06'  06K ' - 

+  K54  t  (07(06/06)V07A605/07A6A5(^04)v  '  * '  "'VrfWWV 
+(060605'  0606A5(04V  04>  V-  ’  •''06?6A5A4A3A2A1,J 

+  K53  f  !05(04V04>  105a403  *' 05 a4 A3 <02V02)V  '  ’  V05A4A3A2A1> 

+  (04  0403V 040;V02V02>V- •  •  V®404A3A2Ai)] 

+  K52[(03(02V0yv03A201V03A2A1) 

+  (020’201V0202A1)] 

+  Kgi[01!)so 
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z*(t)  =  e*g  +  e°  +  (K3rl)  ej.  +  [  X^-l)  +  1  ]  (e*4  +  e°3) 

+  (X4KS2  +  I4K22-1)  e’  +  [  X2(fe32-1)  +  1]  (e*2  +  e°) 

+  (XgKgj  +  XJ^j-1)  ex  +  K31  t  +  z° 

z  2  (t)  =  ^f5  +  (K33-l)  e^  +  [  X^33-l)  +  1]  (e*  +  e°3) 

+  (X4K32  +  X4  K22-l)  ej  +  [  X2  (K32  -1)  +  1]  (e*2  +  e°) 
+  (X2K31  +  X2  Ku-1)  e1  +  K31t  +  z°2 

z*(t)  =  e*5  +  e-  +  (ft34- 1)  e^  +  [  X,  <$34-l)  +  1]  (e*  +  e°) 

+  ^X7K35  +  \'K45~^  e8  +  ^  VK35-1^  +  ^  ^e9  +  el(P 

+  (XqK36  +  Xq  K56  -1)  ei0  +  K36 1  +  Z3 


=  e°6  +  (k34-l)  eg  +  [  X7(fe34-1)  +  1]  (e*  +  e°8) 
+  (X7K35  +  X7  K45-l)  eg  +  [  Xg(K35-l)  +  1]  (e  9 

+  ^X9^36  +  X9  K56-1^  e10  +  ^36  1  +  Z°4 


The  expressions  for  z°,  z2,  zg  ,  and  z^*  are  the  same  a3 
those  for  z°  ,  z°2  ,z°  and  z^  ,  respectively,  with  the  exception 

A 

that  K. .  is  replaced  by  K  .  wherever  it  occurs, 
i]  0 
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z*5  <‘>  » “r  (Fr+c5>  +  ("OW1)  e5  +  t  VC^S"1)*1]  <e4  +e3> 
+  (X4mrK32+^4^22'^  e3  +  t  ’)  +  1]  (eVel' 

+  )x2“Xl  +  x2Kir1>  el  +  “CK31  *  +  1 5 

z6<‘>  *“r  ,Fr+e6>  +  <nCK34-lle6  +  I  +  ^  (eVe8> 

+  (X7m^k35+X7K45-1)  e8  +  [  Xg(mrk35-1)  +  1]  (eg  +e^0) 

+  fXg m rKg s+XgK - 1)  elg  +  “j.Kjj  t  +  z\ 

z7  =  “a(Fa  +  e*l  +  6  2>  *<maiW1>  e2  +  t  X3(maK12-,)+1l 
+  (>3maK13  +  ^3K23-1)  e4  +  ma^13  *  +  z^ 


A 


Z8 (t)  =  “p  <Fp^lO+eP  +  ^35^9  +  r  X8<VS5-1)+1J  (e>°7) 


+  (X8mpK54+X8^‘44_1)  ^7  +  mpK54  1  +  Z8 


P1 


A 


A 


z9(t)  =  ma<Vel  +  e2>  + (maKl2'1)^ 6 2  +f  V'W1*  +1K**3  +  e°4) 

+  (x3 maKi 3^3^23 ” X)  e4  +  t  X5^T1aK13"1^+l  j  ^e5  +  e°6^ 

+  (X5maK144X5^34-l)  eg  +  [  (maK14-l)+lj  (e  7  +  eg) 


•  A 

a  14 

+  (X7raaKj3+X^K4^-l)  +  [ XgCm^Kjg-lJ+l]  (e^  +  e°Q) 


\ 


+  (xgmaK16+VC58"^  e10  +  t  +  z°*Q 
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*10<‘)  =  mp(Fp+e+10  +  eO)  +  <mpK55-l)  e’+[  (e*8+e°) 

+  (^8  V54+ V44-1)  e7+t  +1]  (G  6  +  e5  > 

+  (X6mpK53+X6K33-l)  e’  +  [  (e*4  +e°3) 

+  (X4mpK52+X4  K22-l)  ej  +  [  X2(mpK52-l)  +1]  (e  2  +e<|) 

+  (X2mpK51  +  X2KilL-l)  +  mpK51  t  +  z°*0 

The  expressions  for  z°*,  z°*  ,  z°* ,  z0*,  z°*  ,  and  z^*  are  the 
same  as  those  for  z05,  z°6,  z°„,  z°8,  z°9,  and  2^,  respectively,  with 
the  exception  that  ma,  mr,  mp  m^.,  mrK„,  and  m^.  are 
replaced  by  ma,  mr,  mp,  mV,  m^k..,  and  mV,  respectively, 
wherever  they  occur. 
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Appendix  D 


SUMMARY  OF  TIME  COSTS  FOR  PRIMITIVE  OPERATIONS 


tl,l  =  Ta  +  (1“xl)  z7(cr  } 

+  Xj(1-x4){(56+A8) z *  (Cr  HA6A8z7(Cr^)+m r  f>1*3(Cp)4p1z4(Cp)]  } 

+f  Pj^fCpJ^z^Gp)]} 

*1,2  ■  V(1-Xl)^ca) 

«!  «  -*4)  [  ( VS8)Z5 (C  a'  +A6A8Z  5(C  a' ,mr  ^4 1 (Cp» 

+xlx4f  ZS  (C^+<  ™r1-1)z4<Cp)-z4<Cp>] 


»l>rTr+<1-x2>z6(Cp) 

+x2<1*I£5'^<A3+^>z6(c^(V3A^z6(CP'+mz  z2(Ca)l 

+x2*5t  z*6(Cp)+(^z  ■1)z?(Ca'+4(Ca)! 


W(I-X2>Z8(Cr,' 

4 

VAi)z8(<i2U(  Y3^ZS{Ct  )+mz>3zl<Ca)^z2(Ca)l  } 


P 


+X^Z}CT^A  ■%ilcaHY2(ca)HPs»;(c,)4fJ!j(cJ)]} 
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t2,l=V(1-Xl)z7(Cr1''+xlz*7(Cr1> 

‘2,2  =V(1'Xl'Z5(Ca)+x1z5(Ca) 

<3,1  =V<VS8)z*7<Cr1)+A6A8z7pr1’+mr1['>lz3(V^lZ4(V1 


*3,2  =  MW  25(Ca)tA6i8z5(Ca)+mr1z4(Vp) 

*3,3  =kp°Tp+kp^l-X2)Z8<Cr2' 

+x2(l-x5){6  [P3zl(Ca'+^3z2(Ca)]  } 

^  2  2  p 

+x2x5^Z8(Cr  )+(™z  "1HP3zi^ca^3z2(Ca^ +^Zl(Ca)+^3Z2(Ca^  ^ 

2  p 


t  ,  +  m  K„ftV 
1,4  rx  36  p 


t4,rTr+(1-X2)Z6<Cp)+X2Z6(Cp) 


t4,2=Tp  +  (1-X2)Z8(Cr2)+X2Z8<Cr 
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%i  *  vy  v*5>  5p:v^  vcP)+raz  z2(va> 

P 


‘5,2  =Tp+V V^S  <Cr  M  ^3A5)z8(Cr2)+mzp[,,3zl(Vr>+%z2(Va)] 


tM,kiVka^-*lNPr1) 

+Xj(t-x4){fi6+Z^;CrHa6Ae*7(Cri)+mri[p1*3(Cp)-v1z4(Cp)]} 

+XlX4^Z7^r  ^+^mr  ‘iJf^lz3^p^+,5lz4^o^  +^lz3^p^+Al  z4  vLp^  J 


+  mz  K31Va 
P 

a+m  Koi  V 
a  1,1  z  31  a 
P 


V’v^-vvy^vy 


*6,2-y«-x7)z10<Ca)+x7zlVCa) 

t7,l=Vz7(VrJ!+maKi3^1-x4)[Plz3(Cp)+Plz4!Cp)-vc] 

+VplZ3(Cp)+^Z4(Cp)1} 

‘7,2  "  Wvr  >  +mpK54{(1-x5)t'33zl(Ca)  +53z2(Ca>  ’Vr) 

i 

«6[p3zj(ca)+p!|z*(ca)]} 
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,V=k0rTr*k°r<i‘-sl'VC1> 

+xx(l-x4)[  <  VS8>z5(Ca)+ Vez5  (Ca)+mr^z4(Cp)] 

+xlx4  [  z*g(Ca)+(mri-Dz4(Cp)«*(Cp)+vr] } 

.0  ,  ,0 

=  kr  ti,2+XlX4kr  vr 

+x2(i-x5)[  6p(S3+A5;z*6  (CpW5jA3A5)z6(Cp)+mz  z2(Ca)] 

Jr 

«2*6[  z*6(CpW”z  ->)22<Ca)+z?  (Ca>+vrl  } 

^  P 

=k?VV5k?vr 


8,1 


=  Vz5<Va> 


.  0  ,  .0 
=ka  t2,l+xlka 


+  Z6(V 
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V  =kpVkp°f(1-X2>Z8(Cr„'+!t2[  ZI  <Cr>Vol  ‘ 
=kpH2+X^pVp 


2  “  u  1 2  p 


V'  Vz5(Va)+mrK31z4(Vp> 

*10,2  =Tr+z6(Vp)+mrK36z2(Va) 

*10,3  =kaTa+ka^(1‘xl)z;7(Cr1)+xl[  Z7(C r j )+Va+pl z3 ( Vp'+Pl z4( Vp>  1  < 

*10,4  =  kpTp+kp{(1'X2)z8(Cr  >+x2[  z8(Cr z2<Va»  > 

^  2 

‘11,1=  VZ7<V 

*11,2=  krTr+k?^1'xl>z5*Ca'+xlf  VCa'+vrl ' 


■u,i 


■  V 


Z0(Vp> 
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*12,2 =^Tp*P{(1-X7)Z10<Ca>+z7t  <0(Ca>+V  > 

43,1  =  Ta+z9(V+k2k4Vr1 

*13,2  =  k°Tr+k“  {(l-kjJZjtC^+Xjf  z'5(Ca)+vr+z4(Vp)] } 


*14,1  ■  Tp+z10(Va> 


*14,2  -  kaTa+ka  '1'x1*z9  *Cp'+X^  Z9**V+V^  * 


*  =  Tr,+ZH<Vr  1 

15,1  P  8  r2 


*15,2  =  krTr+kr  ^1"x2^z6^Cp^+X2^  Z6  ^ 


46,1  =  Tp+ZlC(Va)+(k7  k>r>\ 

P  -* 


t16,2  =  k?Tr+k^(1'X2)z6(Cp)+X2t  z*6  <cp>+ VZ2(Va> '  } 
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Appendix  E 


SOLUTION  LISTING  FOR  CASE  1 
OF  THE  MEDICAL  DIAGNOSIS  PROBLEM 

In  order  to  interpret  the  notation  used  in  listing  the  best  solu¬ 
tions  within  each  Vfamily,  the  following  comments  may  be  helpful. 

The  ten  digits  following  a  character  string  of  the  form 

**PF(I) : 

represent  the  values  of  0^. .  .9^  for  0-family  I,  where  the  value  of 
I  simply  indicates  the  position  of  the  0-family  in  the  sequence  of  0- 
families  generated  and  is  used  to  simplify  distinguishing  the  various 
^-families  when  referring  to  them. 

The  number  associated  with  COST  is  the  value  of  the  time  cost 
function  for  the  best  solution  (or  solutions)  within  the  given  0-family, 
and  the  number  associated  with  CPU  reflects  the  cummulative  CPU 
time  in  seconds  used  to  that  point  in  vhe  solution  process. 

Following  the  line  containing  COST  and  CPU  for  a  given  0-family 
are  one  or  more  pairs  of  lines  representing  the  best  solution  (or  solu¬ 
tions,  if  there  is  more  than  one  pair  of  these  lines)  within  the  0-family. 
The  fiist  line  of  the  pair  contains  values  for  0j. .  .0^,0^. .  .  0^, 

Ay  . .  Ajq,  and  in  that  order,  followed  by  the  symbol  S 
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and  a  number  which  represents  the  value  of  the  storage  cost  function 
for  the  given  solution.  The  values  of  0«. .  .  0^  are  the  same  as  those 
indicated  for  the  0-family,  of  course.  The  second  line  of  the  pair 
contains  sixteen  digits,  each  of  which  represents  the  method  chosen 
for  one  of  the  sixteen  primitive  operations,  where  a  value  o?  zero 
indicates  that  the  corresponding  operation  is  not  under  consideration. 

Following  the  last  pair  of  solution  lines  is  a  line  containing  the 
entries  N  and  R.  The  number  associated  with  N  indicates  the  number 
of  feasible  solutions  contained  in  the  0-family  .jid  the  number  associa¬ 
ted  with  R  indicates  the  number  of  those  solutions  rejected  because 
they  violate  the  given  storage  constraint. 

Finally,  appearing  at  the  very  end  of  the  solution  listing  are  a 
number  of  summary  items.  The  number  of  the  ^-family  containing 
the  overall  optimal  solution  is  given  here,  along  with  the  total  number 
of  feasible  solutions  considered  in  the  solution  process,  (i  „e  that 
if  two  or  more  ^-families  contain  solutions  which  are  considered  op¬ 
timal,  the  number  of  the  0-family  given  as  co'-Aaining  the  optimal 
solution(s)  is  that  of  the  last  one  encountered  in  the  sequence  of 
0 -families. )  Also  given  in  this  summary  are  the  average  and  the 
maximum  values  of  .he  time  cost  function  over  ail  feasible  solutions. 

* 

The  actual  value  of  the  storage  cost  function  is  0.  5  units  less  than  the 
value  shown  here.  0.  5  is  added  to  the  actual  value  to  effect  round-off 
when  this  value  is  printed  as  an  integer. 


**********  SOLUTION  BEGINS  ********** 


**PF(  1):  OOOOOOOOCO 

pnoT.  n  P7PQIIR9F  OX  rpii.  t  OHO 

0000000000  COOIOIOOOO  llioioill!  110000111  S:  0.30686500E  05 
0010001000010100 
N:  64  Pj  0 

**PF (  2>:  0000000010 

COST:  0.31865479E  03  CPU:  7.994 

0000000010  0001000010  1110101100  110000100  S:  0. 1 5544500F  05 
00100010000 10 100 
N:  64  P:  0 

**PF(  3):  0000000100 

rncr.  n  ?q7hii77r  nx  rpn«  ii  onn 

0000000100  0100000100  101111 1010  001101000  S:  0.32042500E  05 
0010001000010100 

0000000100  0001000100  1110111010  110001000  S:  0.32042500E  05 
001  COO  100001 0100 
N:  64  P:  0 

**PF(  4):  0000001000 

COST:  0.2574 1382E  03  CPU:  17.653 

0000001000  0000001100  1011110011  001100001  S:  0.33398500E  05 
0010001000010100 

00000010000000001100  1110110011  110000001  S:  0.33398500E  05 

0010001000010100 

N:  96  R:  0 

**PF(  5):  000C001010 

COST:  0.3  188 1 689E  03  CPU:  21.508 

0000001010  0000001010  1011110000  001100000  S:  0 . 1 8256500E  05 
0010001000010100 

OOOCOOIOIO  0000001010  1110110000  110000000  S:  0, 18256500E  05 
0010001000010100 
N:  64  R:  0 

**PF(  6):  0000001100 

COST:  0.30840747E  03  CPU:  25.470 

0000001100  0000001100  1011110010  001100000  S:  0.34754500E  05 
0010001000010100 

00000011000000001100  1110110010  110000000  S:  0.34754500E  05 
0010001000010100 
N:  54  P:  0 
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**PF(  7) f  0000010000 

COST:  0.26728027E  03  CPU:  31.347 

0000010000  0100010100  1011101011  001100001  S:  0.34754500E  05 
0010001000010100 

0000010000  0001010100  1110101011  110000001  S:  0.34754500E  05 
0010001000010100 
N:  96  R:  0 

**PF(  g):  0000010010 

COST:  0.328683  HE  03  CPU:  35.200 

0000010010  0100010010  1011101000  001100000  S:  0.19612500E  05 
0010001000010100 

0000010010  0001010010  1110101000  110000000  S:  0. 19612500E  05 
0010001000010100 
N:  64  R :  0 

**PF(  9):  0000010100 

COST:  0.32350000E  03  CPU:  39.066 

0C00010100  0100010100  lOlllOlOlO  OOHOOOOO  S:  0.361  10500E  05 
0010001000010100 

0C00010100  0001010100  1  1 10101010  110000000  S:  0.361 10500E  05 

0010001000010100 

N:  64  P:  o 

**PF( 10):  0000100000 

COST:  0.25296194E  03  CPU:  46.690 

0000100000  0000110000  1011001111  0010001 1 1  S:  0.33398500E  05 
0010001000010100 

0000 1 OGOOO  0000100100  101 101  1011  001001001  S:  0.33398500E  05 
0010001000010100 

0000100000  0000110000  ! 1 1000 111  1  1100001  1  1  S:  0.33398500E  05 
0010001000010100 

0000100000  0000100100  1110011011  110001001  s:  0.33398500E  05 
0010001000010100 
N:  128  R:  0 

**PF(  11):  0000100010 

COST:  0 .3  1 436475E  03  CPU:  52.439 

0000100010  0000100010  1011001100  001000100  S:  0. 18256500E  05 
0010001000010100 

0000100010  0000100010  ; 0 1 1 0 1 1 000  OOIOOIOOO  S:  0. 18256500E  05 
0010001000010100 

C000100010  0000100010  1110001100  1  1 0000 ! 00  S:  0. 1 8256500E  05 
0010001000010100 

0000100010  0000100010  1110011000  110001000  S:  0. 18256500E  05 
0010001000010100 
N:  96  P:  0 


**PF( 12) :  0000100100 


COST!  0.309 17554E  03  CPU:  58.208 

0000100100  0000100100  1011011010  001001000  S!  0.34754500E  05 
0010001000010100 

0000100100  000C100100  1110011010  110001000  s:  0 .3 4754500E  05 
0010001000010100 
N!  96  P:  0 

**PF(13):  0000101000 

COST:  0.2688 1396E  03  CPU:  63  .894 

0000101000  0000101100  1011010011  001000001  S:  0.37.‘1665G0E  05 
0010001000010100 

0000101000  0000101100  1110010011  110000001  S:  0.37466500E  05 
0010001000010100 
N:  96  R:  0 

**PF( 14):  0000101010 

COST:  0-.33021655E  03  CPU:  67.711 

0000101010  0000101010  101  1010000  001000000  t> :  0.22324500E  05 
0010001000010100 

0000101010  0000101010  1110010000  110000000  S:  0.22324500E  05 
0010001000010100 
n:  64  R:  o 

**PF( 15):  0000101100 

COST:  0 .3 1980786E  03  CPU:  71.573 

0000101100  0000101100  1011010010  001000000  S:  0.38822500E  05 
0010001000010100 

0000101100  0000101100  1110010010  110000000  S:  0.38822500E  05 
0010001000010100 
N:  64  R:  0 

**PF( 16):  0000110000 

COST:  0.27864404E  03  CPU:  77.432 

0000110000  0000110100  1011001011  001000001  S:  0.37466500E  05 
0010001000010100 

0000110000  0000110100  1 1 K0CU101 1  110000001  S:  0.37466500E  05 

0010001000010100 

N:  96  P:  0 

**PF( 1 7):  0000110010 

COST:  0.34004712E  03  CPU:  81.256 

0000110010  0000110010  1011001000  001000000  S:  0.22324500E  05 
OOIOOOIOOOC’OIOO 

0000110010  0000110010  1110001000  110000000  S:  0.22324500F  05 
0010001000010100 
N:  6*  P:  0 
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**PF( 18):  0000110100 


COST:  0.33486401 E  03  CPU:  85.122 

0000110100  0000110100  1011001010  0010CG000  S:  0.38822500E  C5 
0010001000010100 

0000110100  0000110100  1110001010  110000000  S:  0.38822500E  05 

0010001000010100 

N:  64  R:  0 

**PF< 19):  0001000000 

COST:  0.25642383E  03  CPU:  96.713 

0001000000  0001010000  1130101111  ilOOOOll!  S:  0.32494500E  05 
0010001000010100 

000J0C0000  0001000100  1110111011  110001001  S:  0.32494500E  05 

0010001000010100 

N:  192  R:-  0 

**P.F (20) :  0001000010 

COST:  0.3 1 782666E  03  CPU:  105. 24S 

0001000010  0001000010  1110101100  110000100  S:  0.1 735250CE  05 
0010001000010100 

0001000010  0001000010  1110111000  110001000  S:  0. i 7352500E  05 
0010001000010100 
N:  144  R :  0 

**PF(21>:  0001000100 

COST:  0.3  1264380E  03  CPU:  113.689 

0001000100  0001000100  1110111010  110001000  S:  Q.3385G50QE  05 
00 10001 000010100 
N:  144  'Hi  0 

**PF(22):  0001001000 

COST:  0.272281  74E  03  CPU:  122.251 

0001001000  0001001  ICO  1110110011  110000001  S:  0.3656250CE  05 

0010001000010100 

N:  144  R:  0 

**PF(23  >:  0001001010 

COST:  C.33368457E  03  CPU:  127.802 

0001001010  0001001010  11101  10000  l 10000000  S:  0.21 420500E  05 
0010001000010100 
N:  96  R:  0 

**PF(24):  0001001100 

COST:  0.32327588E  03  CPU:  133  -406 

00010011000001001100  1110110010  llOUOOOOO  S:  0.37918500E  05 

0010001000010100 

N:  96  R:  0 


S99 


**PF<25):  ooomioooo 


COST*  0.282112C6E  03  CPU*  141.994 

0001010000  0001010100  1 1 lOlOiOl I  110000001  Si  0. 36562 50GE  05 
0010001000010100 
N*  144  Rl  0 

**PF(26>:  0001010010 

COST:  G.34351489E  03  CPU*  147.505 

000101C010  0001010010  1110101000  110000000  S*  0.21420500E  05 
0010001000010100 
N*  96  R;  0 

**PF(27):  0001010100 

COST*  0.33833203E  03  CPU*  153.197 

0001010100  0001010100  1110101010  110000000  Ss  0.379 18500E  05 
0010001000010100 
N:  96  R:  0 

**PF<28):  0010000000 

COST:  0.25296194E  03  CPU*  159.030 

0010000000  0010010000  1001 1011 11  000100111  Si  0.33398500E  05 
301000100001C100 

0010000000  0010000100  1001 £11011  000101001  St  0.33398500E  05 

0010001000010100 

N*  96  R:  0 

**PF(29)I  0010000010 

COST:  0.31436475E  03  CPU:  163.814 

0010000010  0010000010  1001101100  000100100  S:  0. 18256500E  05 
0010001000010100 

0010000010  0010000010  100 1 1 1  1 000  000101000  S:  0.18256500E  05 

0010001000010100 

N*  80  hi  0 

**PF<30):  0010000100 

COST:  0.30398584E  03  CPU*  168.564 

0010000100  00110C0100  1100111010  100001000  S:  0.33398500E  05 
0010001000010100 
r*  80  R:  c 

**PF(3 1 ):  0010001000 

COST:  0.26358789E  03  CPU*  174.214 

0010001000  0010001100  1100110011  100000C01  S*  0.34754500E  05 
0010001000010100 
N*  96  R*  0 
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**PF<32):  0010001010 


COST:  0.32499072E  03  CPU:  177.958 

0010001010  0010001010  1100110000  100000000  S:  0. 19612500E  05 
0010001000010100 
Nl  64  R:  0 

**PF(33>:  0010001100 

COST:  0.31458154E  03  CPU:  181.806 

0010001100  0010001100  1100110010  100000000  S:  0.361 10500E  05 

ooioooiorooioioo 

N:  64  h :  0 

**PF(34):  0010010000 

COST:  0.27345410E  03  CPU:  187.486 

0010010000  0011010100  1100101011  100000001  S:  0.361 10500E  05 

0010001000010100 

N:  96  R:  0 

**PF(35):  0010010010 

COST:  0.334856S3E  C3  CPU:  191.280 

0010010010  0011010010  1100101000  100000000  S:  0.20968500E  05 
0010001000010100 
N:  64  R:  0 

**PF(36):  0010010100 

COST:  0.32967383E  03  CPU:  195.043 

0010010100  00  1 10  1 0 iOO  1100101010  100000000  S:  0.37466500E  05 

0010001000010100 

N:  64  R:  0 

**PF(37):  0010100000 

COST:  0.25913574E  03  CPU:  200.745 

00101000000010110000  1 1 0000 1111  100000111  S:  0.34754500E  05 
0010001000010100 

0010100000  001C100100  1100011011  100C01001  S:  0.34754500E  05 
0010001000010100 
N:  96  R:  0 

**PF(38):  0010100010 

COST:  0.32053857E  03  CPU:  204.995 

0010100010  0010100010  1100001100  100000100  S:  0,196! 2500E  05 
0010001000010100 

0010100010  0010100010  1100011000  100001000  S:  0.1961 2500E  05 

ooicooiooooio ioo 

N:  72  P:  0 
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rpo*  yjHwt#  ’**«**!<►«• 


**PF<39):  0010100100 
COST:  0.31534961E  06  CPU:  209.299 

0010100100  0010100100  1100011010  100001000  S:  0.361 10500E  05 
0010001000010100 
N:  72  R :  0 

**PF<40):  0010101000 

POST*  0.?74<5R77QF  OX  PPM.  P  1  X  57X 

0010101000  0010101  100  1100010011*  100000001  S:  0.388225C0E  05 

0010001000010100 

Ns  72  R:  0 

**PF(41):  0010101010 

COST:  0 .33629062E  03  CPU:  216.410 

0010101010  0010101010  1100010000  100000000  S:  0.23680500E  05 
0010001000010100 
N:  48  R*  0 

**PF(42):  0010101100 

CCS?*  0.32598 169E  03  CPU:  219.277 

0010101100  0010101  100  1 1 000 10010  100000000  S:  0,401 78500E  05 

ooi ooo ioooo io ion 

N:  48  R:  0 

**PFC43>:  0010110000 

COST:  0.28481787E  03  CPU;  223.520 

0010110000  00 » Oi  10100  1100001011  100000001  S:  0.38822500E  05 
0010001000010100 
NS  72  R:  0 


**PF<44):  0010110010 


COST:  0.34622095E  03  CPU:  220.373 

00101 '0010  00101  IOC  10  1100001000  100000000  S:  0.23680500E  05 
0010001000010100 
N:  48  R:  0 

**PF(45) :  0010110100 

COST:  0.34103809E  03  CPU:  229.201 

0010110100  0010110100  1100001010  100000000  S:  0.401 78500E  05 

0010001000010100 

N:  48  R:  0 


402 


**PF(46):  0011000000 

COST:  0.26059S51E  03  CPU:  236.847 

0011000000  001 1010000  1100101111  100000111  S:  0.33850500E  05 
0010001000010100 

0011000000  0011000100  1100111011  100001001  S:  0.33850500E  05 
0010001000010100 
Nt  128  R:  0 

**PF(47):  0011C00010 

COST:  0.32200244E  03  CPU:  242.586 

0011000010  0011000010  1 10010 1 100  100000100  S:  0.18708500E  05 
0010001000010100 

0011000010  0011000010  1100111000  100001000  S:  0.18708500E  05 

0010001000010100 

N:  96  Rf  0 

**PF(48):  0011000100 

COST:  0.31S81982E  03  CPU:  248.252 

0011000100  0011000100  1100111010  100001000  S:  0.35206500E  05 
0010001000010100 
N:  96  R:  0 

**PF(49):  001 1001G0O 

COST:  0.27645776E  03  CPU:  253.958 

0011001000  0011001100  1100110011  100000001  S:  0.37918500E  05 

0010001000010100 

N:  96  R:  0 

**PF<50):  0011001010 

COST:  0.33786060E  03  CPU:  257.620 

0011001010  0011001010  1100110000  100000000  S:  0.22776500E  05 

0010001000010100 

N:  64  R:  0 

**PF(5i ):  001 1001100 

COST:  0 .32745 1 66E  03  CPU:  261.398 

0011001100  0011001100  1100110010  100000000  S:  0.39274500E  05 
0010001000010100 
N:  64  R:  0 


**PF( 52 )!  00 1  10 10000 


COST:  0.28628809E  03  CPU:  267.116 

0011010000  001 1010100  1100101011  100000001  S:  0.37918500E  05 

00100010C0010100 

N:  96  R:  0 
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**PF'53 ):  0011010010 


COST:  0.347691 16E  03  CPU:  270.921 

0011010010  0011010010  1100101000  100000000  S:  0.22776500E  05 

OUlOOOlOOOOiOlOO 

N:  64  R:  0 

**PF(54):  001  1 0 1 0 IOC 

COST:  C  .34250806E  03  CPU:  274.643 

001101C100  0011010100  1100101010  100000000  S:  0.39274500E  05 
001000100 jOIOIOO 
N:  64  R:  0 

**PF(55):  0100000000 

COST:  0.25642383E  03  CPU:  284.501 

0100000000  0100010000  10  1 1 10  1 1  1  1  001100111  S:  0.32494500E  05 
0010001000010100 

0100000000  0100000100  101  111  1011  001101001  S:  0 .3249 4500E  05 
0010001000010100 
NS  160  P:  0 

**FF(56):  C100000010 

COST:  0.3 1 782666E  03  CPU:  292.119 

0100000010  0100000010  1011101  ICO  OOliOOlOO  S:  0, ! 7352500E  05 
0010001000010100 

0100000010  0100000010  1011111000  001101000  S:  0J7352500E  05 

0010001000010103 

N:  128  R:  0 

**PF<57):  01000C0100 

COST:  0.30710547E  03  CPU:  299.813 

0100000100  0101000100  1010111010  000001000  S:  0.33398500E  05 

0010001000010100 

N:  128  R:  0 

**PF(58):  0100001000 

COST:  0.26670776E  03  CPU:  308  .275 

0100001000  0100001100  1010110011  000000001  S:  0.34754500E  05 

001000100001010C 

N:  144  R:  0 

**PF(5S):  0100001010 

COST:  0 ,328 1 1 060E  03  CPU:  313.845 

01000C1010  0100001010  1010110000  000000000  S:  0.19612500E  C5 

0010001000010100 

N:  96  R:  0 
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**PF<60):  0100001100 


COST:  0.3  1770166E  03  CPU:  319.521 

0100001100  0100001100  1010110010  000000000  S:  0.361 10500E  05 
00 10001 0000 10 100 
N:  96  R  :  0 

**PF(61 ):  0100010000 

COSt:  0.27657422E  03  CPU:  327.953 

0100010000  0101010100  1010101011  000000001  S:  0.361 10500E  05 
0010001000010100 
N:  144  R:  0 

**PF(62):  0100010010 

COST:  0.33797705E  03  CPU:  333  .529 

0100010010  0101010C10  1010101000  000000000  S:  0.209 68500E  05 
0010001000010100 
NJ  96  R:  0 

**PF<63):  0100010100 

COST:  0.33279395E  03  CPU:  339.118 

0100010100  0101010100  1010101010  000000000  S:  0.37466500E  05 
0010001000010100 
N:  96  R:  0 

**PF(64 ) :  0100100000 

COST:  0.26225562E  03  CPU:  346.644 

0100100000  OiOOllOOOO  1010001111  00000011  1  S:  0,3 4754500E  05 
0010001000010100 

0100100000  0100100100  1010011011  000001001  S:  0.34754500E  05 

ooiocoiooonoioo 

N:  128  R:  0 

**PF(65) :  0100100010 

COST:  0.32365845E  03  CPU:  352.244 

0100100010  010010C010  iOlOOOl 100  000000100  b:  0.19612500E  05 
0010001000010100 

0100100010  0100100010  1010011000  000001000  S:  0.19612500E  05 

0010001000010100 

N:  96  R:  0 

**PF(66):  0100100100 

COST:  0.3  1846973E  03  CPU:  357.780 

0100100100  0100100100  1010011010  000001000  S:  0.361 10500E  05 

00100010000101C0 

N:  96  R:  0 
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**PF(67) 


010010100C 


COST:  0.2781079  IE  03  CPU:  363.331 

0100101000  0100101100  1010010011  000000001  S:  0.38822500E  05 

0010001000010100 

N:  96  R:  0 

**PF(68>:  0100101010 


COST:  0.33951074E  03  CPU:  367.008 

0100101010  010C101010  1010010000  000000000  S:  0.23680500E  05 
0010001000010100 
N:  64  P:  0 

**PF(69):  0100101100 


COST:  0.329  10 132E  03  CPU:  370.736 

0100101100  0100101100  1010010010  000000000  S:  0.401 78500E  05 
0010001000010100 
N:  64  R:  0 

**PF( 70) :  0100110000 


COST:  0.28 793 799 E  03  CPU:  376.424 

0100110000  01001 10100  1010001011  OOOOOOOOl  S:  0.38822500E  05 

0010001000010100 

N:  96  P:  0 


**PF( 71):  01001  10010 


COST:  0 .34934 13  1  E  03  CPU:  380.156 

0100110010  01001 10010  1010001000  000000000  S:  0.23680500E  05 
0010001000010100 
N:  64  R:  0 

**PF<  72 ):  0100110100 


COST:  0.3441579SE  03  CPU:  383  .873 

0100110100  0100110100  1010001010  000000000  S:  0.401 78 500E  05 

0010001000010100 

N:  64  P:  o 


**PF( 73) :  0101000000 


COST:  0 ,2657 1  729E  03 
0101000000  0101010000 
0010001 jOOOIOI 00 
0101000000  0101000100 
0010001000010100 
N:  128  P:  0 


CPU:  391  .424 
1010101111  0000001 1 1 

1010111011  000001001 


S:  0.3385C500F  05 
S:  0.33850500E  05 


408 


**PF(74):  0101000010 


COST:  0.32712061E  03  CPU:  397.076 

0101000010  0101000010  1010101100  000000100  S:  0.18708500E  05 
0010001000010100 

0101000010  010 10000 10  10 10 1 1 1000  000001000  S:  0.18708500E  05 
0010001000010100 
N:  96  R:  0 

**PF(75):  0101000100 

COST:  C.52193750E  03  CPU:  402.683 

0101000100  0101000100  1010111010  000001000  S:  0.35206500E  05 
0010001000010100 
Nr  96  R:  0 

**PF(7C>:  0101001000 

COST:  0.281 57593E  03  CPU:  408.477 

0101001000  0101001100  1010110011  000000001  S:  0.37918500E  05 
0010001000010100 
N:  96  R:  0 

**PF( 77) :  0 10 100 1 0 i 0 

COST:  0,34297852E  03  CPU:  412.182 

OlOlOOJCiO  0101001010  1010110000  000000000  S:  0.22776500E  05 

0010001000010100 

N:  64  P:  0 

**PF(78):  0101001100 

COST:  0.33256958E  03  CPU:  415.920 

0101001100  0101001100  1010110010  000000000  S:  0.39274500E  05 
0010001000010100 
N:  64  R:  0 

**PF(79>:  0101010000 

r*  pic  T  •  n  p  p  I  j  •  4  P I  S  p 

0101010000  0101010100  10 10 10 10 1 1  000000001  at  0.379 18500E  05 
0010001000010100 
N:  96  R:  0 


**PFf80):  0101010010 


COST:  0.35280908E  03  CPU:  425.174 

OlOlOiOO 10  0101010010  1010101000  000000000  S:  0.227765U0E  0? 

001000100001C100 

N:  64  R:  0 


407 


**PF(81 ):  0101010100 


COST:  0.34762622E  03  CPU:  428.804 

0101010100  0 i 0 i 0 1 0 100  1010101010  000000000  S:  0.39274500E  05 

0010001000010100 

N:  64  R:  0 

PFC28)  CONTAINS  PEST  SOLUTIONCS).  7232  SOLUTIONS  CONSIDERED. 

PEA N  COST  OVER  ALL  SOLUTIONS:  0.35064868E  03 

WAV  I  PUP  COST  ENCOUNTERED:  0.57534351E  03 

**********  SOLUTION  TERMINATED  ********** 


Appendix  F 


SOLUTION  LISTING  FOR  CASE  2 
OF  THE  MEDICAL  DIAGNOSIS  PROBLEM 


**********  SOLUTION  BEGINS  ********** 

**PF(  1):  0000000000 

COST:  0.19451 155E  03  CPU:  4.286 

0000000000  0001000000  1110101111  110000111  S:  0 . 1 8482500E  05 
0010001000010100 
N:  64  P:  0 

**PF(  2):  0000000010 

.-'ncT.  n  PXSPIQIOF  03  p  p  1 1  •  a  405 

0000000010  0001000010  1110101101  110000100  S:  0. 19838500E  05 
0010001000010100 
N:  64  P:  0 

**PFl  3):  0000000100 

COST:  0.22721243E  03  CPU:  12.602 

0000000100  010000C110  1011111001  001101000  S:  0.26618500E  05 
0010002000010100 

0000000100  0001000110  1110111001  110001000  S:  0.2661 8500E  05 
0010002000010100 
N:  64  R:  C 

**PF (  4):  0000001000 

COST:  0 . 19406752E  03  CPU:  18.755 

0000001000  0000001000  1011110011  001100001  S:  0.22098500E  05 
0010001000010100 

0000001000  0000C01000  111011001!  110000001  S:  0.22098500E  05 

001 000 10000  in  100 

N:  96  P:  0 
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**PF(  5):  0000001010 


COST:  0.2347751  OE  03  CPU:  22.885 

0000001010  0000001010  1011110001  001100000  S:  0.23454f00E  05 
0010001000010100 

OCOOOOIOIO  0000001010  1110110001  110000000  S:  0.23454500E  05 
0010001000010100 
N:  64  R:  0 


**PF(  6):  0000001100 

COST;  0.24145790E  03  CPU:  27.060 

000000!  100  0000001100  101  I  1100(1  001100001  S:  0.34754500E  05 
0010001000010100 

0000001100  0000001100  1110110011  110000001  S:  0.34754500E  05 
0010001000010100 
lit  64  RC  0 


**PF(  7):  0000010000 

COST:  0 .2073451 5E  03  CPU:  33,207 

C000010000  0100010000  1011101011  001 10^001  S:  0.23906JD0E  05 
0010001000010100 

0000010000  0001010^00  1110101011  i  1000.1001  S:  0.23906500E  05 
00  1 000 IOC  'JO  10100 
N:  96  R:  0 

**PF(  8)!  0000010010 


COST:  0.24805269E  03  CPU:  37.311 

0000010010  0100010010  1011101001  001100000  S:  0.25262500E  05 
0010001000010100 

0000010010  0001010010  1110101001  110000000  S:  0. 25262 500E  05 
0010001000010100 
Nt  64  R:  0 


**PF(  9):  0000010100 


COST:  0.261 57520E  03  CPU:  41.416 

0000010100  0100010100  1011101011  001100001  S:  0.36562500E  05 
0010001000010100 

0000010100  000 1 0 1 0 1 00  1110101011  1 1 000000 1  S:  0.36562500E  05 
0010001000010100 
N:  64  Pi  0 
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**PF( iO):  OCOOIOOOOC 

COST:  0.18838756E  03  CPU:  49.613 

0000100000  0000100000  1011001111  001000111  S:  0.22098500E  05 
0010001000010100 

0000100000  0000100000  1011011011  001001001  S:  0.22098500E  05 
0010001000010100 

00001000000000100000  1110001111  110000111  S:  0.22098500E  05 
0010001000010100 

0000100000  OOOOICOOOO  1110011011  110001001  Si  0.22098500E  05 

0010001000010100 

N:  128  P:  0 

**PF( 11):  0000100010 

rn^T*  fl  ppii*  5S  744 

0000*100010  0000 1 000  10  10  I  1 00  1 1 0!  001000100  S:  0.23454500E  05 
0010001000010100 

0000100010  0000100010  1011011001  001001000  s:  0.23454500E  05 
0010001000010100 

CC00100010  0000100010  1110001101  110000100  S:  0.23454500E  05 
0010001000010100 

0000100010  0000100010  1110011001  110001000  S:  0.23454500E  05 
0010001000010100 
N:  96  R:  0 

**PF( 12) :  0000100100 

COST?  0.2426099 1 E  03  CPU:  61.852 

0000100100  0000100100  1011011011  001001001  S:  0.34754500E  05 
0010001000010100 

00001C0100  0000100100  1110011011  110001001  S:  0,3 4754500E  05 

00100010000J0I00 

N:  96  R;  0 

**PF( 13):  0000101000 

COST:  0.20901950E  03  CPU:  67.914 

0000101000  0000101000  1011010011  001000001  S:  0.27522500E  05 
0010001000010100 

0000101000  0000101000  II 10010011  110000001  St  0.27522500E  05 
0010001000010100 
N:  96  R:  0 

**PF<14):  0000101010 

COST:  0.2497271 1 E  03  CFJ:  71.952 

0000101010  0000101010  1011010001  001000000  S:  0.28878500E  05 
0010001000010100 

00001010100000101010  1110010001  110000000  S:  0.28878500E  05 
0010001000010100 
N:  64  R:  0 
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**PF( 15):  0000101100 


COST:  0.25640967E  03  CPU:  76,022 

0000101100  0000101100  1011010011  001000001  S:  0,401 78500E  05 
0010001000010100 

0000101100  OOOOlCilOO  1110010011  110000001  S:  0.401 785C0E  05 
0010001000010100 
N:  64  P;  C 

**PF( 16):  0000110000 

COST:  0.22228516E  03  CPU:  82.168 

C0001 10000  0000110000  1031001011  001000001  S:  0.27522500E  05 
0010001000010100 

0000110000  0000110000  1110001011  110000001  S:  0.27522500E  05 
0010001000010100 
N:  96  R:  0 

**PF(17):  0000110010 

COST:  0.26299243E  03  CPU:  86.235 

0000110010  0000110010  1011001001  001000000  S:  0.2S878500E  05 
0010001000010100 

0000110010  0000110010  1110001001  110000000  s:  0.28878500E  05 
0010001000010100 
N:  64  R:  0 

**PF(18):  0000110100 

pftCT*  n  rpn*  on 

ooooi 10100  0000110100  1011001011  OOlOOOOOl  S:  0.401 78500E  05 
0010001000010100 

00001101000000110100  1110001011  110000001  S:  0.40178500E  05 
0010001000010100 
N:  64  P:  0 

**PF( 19);  0001U00000 

COST:  0.19 1 883 5 6 E  03  CPU:  102. *76 

0001000000  0001000000  1110101111  110000111  S;  0 .207 42500E  05 
OOlOOOlOOOOIOlOO 

0001000000  0001000000  1110111011  110001001  S:  0.20742500E  05 

OOlOOOlOOOOIOlOO 

N:  192  R:  0 

**PF(20) :  0001000010 

COST:  0.23259109E  03  CPU:  111.484 

0001000010  0001000010  1110101101  IICOOOIOO  S:  0, 22O98E|O0F  05 
OOlOOOlOOOOIOlOO 

0001000010  00010C0010  lliomooi  IIOOOIOOO  S:  0.220985CCE  C5 

OOlOOOlOOOOIOlOO 

N:  144  R:  0 


**PF(21 ):  000100010C 


rncT*  o  f  nx  rpn*  ipn  an7 

0001000100  0001000110  1 1 101 1 1001  110001000  S:  0.2887C500E  05 

0010002000010100 

N:  144  R:  0 

**PF(22):  0001001000 

COST:  0.2 1252350E  03  CPU:  129.336 

0001001000  0C01001000  1110110011  110000001  S:  0.26166500E  05 

0010001000010100 

N:  144  R:  0 

**PF(23):  0001001010 

COST:  0.25323 1 09E  03  CPU:  135.231 

0001C01010  0001001010  1110110001  110000000  S:  0.27522500E  05 
0010001000010100 
N:  96  R:  0 

**PF(24):  0001001100 

COST:  0.25991357E  03  CPU:  141.192 

0001001100  0001001100  1110110011  110000001  S:  0.38822500E  05 

0010001000010100 

Nt  96  R:  0 

**PF<25) :  0001010000 

COST:  0.2257891  IE  03  CPU:  150.184 

00010100000001010000  1110101011  110000001  S:  0.26166500E  05 
0010001000010100 
N:  144  R:  0 

**PF(26):  0001010010 

COST:  0.26649634E  03  CPU:  156.101 

0001010010  0001010010  1110101001  110000000  S:  0.2752250CE  05 
0010001000010100 
N:  96  R:  0 

**PF(27) :  00010*0100 

COST:  0.27023193E  03  CPU:  161.993 

0001010100  OOC'OIOIIO  1110101001  110000000  S:  0.34302500E  05 

0010002000010100 

N:  96  P:  0 


413 


**PF(28>:  0010000000 


COST:  0.18838756E  03  CPU;  168.150 

0010000000  001C00C000  1001101111  000100111  S:  0.22098500E  05 
0010001000010100 

0010000000  0010000000  1001 1 11011  000101001  S;  0.22098500E  05 

00100010000I010C 

N:  96  R:  0 

**PF(29)S  0010000010 


COST;  0.22909509E  03  CPUs  173. 214 

0010000010  0010000010  1001101101  000100100  Ss  0.23454500E  05 
0010001000010100 

0010000010  0010000010  1001111001  000101000  S:  0.23454500E  05 

ooioroiooooioioo 

N:  80  R:  0 

**PF(30):  0010000100 


COST*  0.23378389E  03  CPU:  178.211 

0010000100  0011000100  1100111011  100001001  Ss  0.324Q4500E  05 

0010001000010100 

Ns  80  Rs  0 

**PF(3  1 ):  ooioomooo 

COST;  0.200181 50E  03  CPUs  184.190 

0010001000  0010001000  1100110011  100000001  Ss  0.23454500E  05 
0010001000010100 
Ns  96  Rs  0 

**PF(32)S  0010001010 


COST*  0.2408891  IE  03  CPU:  188.149 

0010001010  0010001010  1100110001  100000000  Ss  0.24810500E  05 

0010001000010100 

NS  64  Rs  0 

**PF(33 ):  0010001100 


COST*  0.24757190E  03  CPUs  192.159 

00100011000010001100  1100I10C11  100000001  Ss  0.361 1 0500E  05 
0010001000010100 
Ns  64  Rs  0 


**PF(34>:  0010010000 
COST:  0 ,2  13459  1  7E  03  CPUs  198.159 

0010010000  001 1C10000  1100101011  10C000001  Ss  0.25262500F  05 
0010001000010100 
N:  96  Rs  0 
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■r  «.v*»**  v* 


**PF(35):  0010010010 

COST:  0.25416669E  03  CPU:  202.115 

0010010010  001  1010010  1100101001  100000000  S:  0.2661B500E  05 

0010001000010100 

N:  64  R:  0 

**PF(36):  0010010100 

COST:  0.26768921 E  03  CPU:  206.067 

0010010100  0011010100  1100101011  100000001  S:  0.37918500E  05 

0010001000010100 

N:  64  R:  0 

**PF(o7):  0010100000 

COST:  0* 19450 1 56E  03  CPU:  212.098 

0010100000  0010100000  110C0011U  100000111  S:  0.23454500E  05 
0010001000010100 

0010100000  0010100000  1100011011  100001001  S:  0.23454500E  05 
0010001000010100 
N:  96  R:  0 

**PF(38):  0010100010 

COST:  0.235209 12E  03  CPU:  216.600 

0010100010  0010100010  1100001101  lOOOOOlOO  S:  0.24810500E  05 
0010001000010100 

0010100010  0010100010  1100011001  100001000  S:  0.24810500E  05 
0010001000010100 
N:  72  R:  0 

**PF(39):  0010100100 

COST:  0.24872389E  03  CPU:  221  .079 

0010100100  0010100100  1100011011  100001001  S:  0,36! 10500E  05 
0010001000010100 
N:  72  R:  0 

**PF(40) :  0010101000 

COST:  0.2 1513350E  03  CPU:  225.537 

0010101000  0010101000  1 1 000 i 001 1  100000001  S:  0.28878500E  05 
0010001000010100 
N:  72  R:  0 


**PF<41>:  0010101010 


COST:  0.25584109E  03  CPU:  228  .502 
0010101010  0010101010  1 1 000 1 000 1  100000000 


S:  0.30234500E  05 
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**PF(42):  0010101100 


COST:  0.26252344E  03  CPU:  231.499 

0010101100  0010101100  1 1 000 i 00 1 1  10C000001  S:  0.4153450CE  05 
0010001000010100 
N:  4P  P:  0 

**PF(43):  0010110000 

COST:  0.2283991  IE  03  CPU:  236.030 

0010110000  0010110000  1100001011  1000000C1  S:  0.28878500F  05 
0010001000010100 
N:  72  P:  0 

**PF(44) :  0010110010 

COST:  0 .269  1 0645E  03  CPU:  239.021 

0010110010  0010110010  1100001001  100000000  S:  0.30234500E  05 
0010001000010100 
N:  48  P:  0 

**PF(45) :  00101 10100 

COST:  0 ,282629  1  5E  03  CPU:  242.006 

0010110100  0010110100  1100001011  100000001  S:  0,41 534500E  05 
0010001000010100 
N:  48  P:  0 

**PF(46) :  001 1000000 

COST:  0. 19599956F  03  CPU:  250.154 

0011000000  0011000000  1100101111  lOOOOOlll  S:  0.220S>8500E  05 
0010001000010100 

0011000000  0011000000  1100111011  100001C01  S:  0.22098500E  05 
0010001000010100 
N:  128  P:  0 

**PF(47):  0011000010 

COST:  0  .236707  1  1  E  03  CPU:  256.192 

0011000010  0011000010  1100101101  100C00100  S:  0.234545C0E  05 
0010001000010100 

0011000010  0011000010  1100111001  100001000  S:  C .23 454500E  05 

0010001000010100 

N:  96  P:  0 

**PF(48):  001 1000 1 00 

COST:  :.2502r  ^9E  03  CPU:  262.159 

0011000100  0  11000100  1100111011  100001001  S:  0.34754500E  05 
00 10001 0000  100 
N:  96  F:  0. 
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**PF<49):  001  10010C0 


COS Ti  0,2  1 6639 5 1 E  03  CPU:  268.124 

0011001000  0011001000  1100110011  100000001  S:  0, 27522 500E  05 

0010001000010100 

fit  96  P:  0 

**PF(50>:  0011001010 

COST:  0.25734668E  03  CPU:  272.073 

00110010100011001010  1100110001  100000000  S:  0.28878500E  05 

0010001000010100 

N:  64  P :  0 


**P F( 5 1 ) :  001 1001 100 

COST:  0.26402954F  03  CPU:  276.090 

00110011000011001100  1100110011  100000001  S:  0.40178500E  05 
O'0 10C0  10000 10  100 
N:  64  R:  C 

**PF(52):  0011010000 

COST:  0.2299051  IE  03  CPU:  282.135 

0011C10000  OOllOiOOOO  1100101011  100000001  S:  0.27522500E  05 
0010001000010100 
N:  96  R:  0 


**PF(53):  0011010010 

COST:  0.27061230E  03  CPU:  286.106 

0011010010  001 1 C 1 00 1 0  1100101001  100000000  S:  0.288785C0E  05 
0010001000010100 
N :  64  R:  0 

**PF(54):  001 1 0 1 0 1 OC 


COST:  0.284I3525E  03  CPU:  290.089 

00110101000011010100  1100101011  100000001  S:  0.40178500E  05 

0010001000010100 

N:  64  R :  0 

**PF(55):  0100000000 

COST:  0.191 88356E  03  CPU:  300.349 

0100000000  0100000000  1011101111  001100111  S:  0.20742500E  05 
CO  1 000 1 0000 10100 

0100000000  0100000000  1011111011  001101001  S:  0.20742500E  05 

0010001000010100 

N:  160  R:  0 
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**PF<56>:  0100000010 


COSTS  0.23259109E  03  CPU:  308.444 

0100000010  0100000010  1011101101  001100100  S:  0.220985L0E  05 
0010001000010100 

0100000010  0100000010  1011111001  001101000  S:  0.22098500E  05 

0010001000010100 

N:  128  R:  0 

**PF<57):  0100000100 

COST:  0.22953642E  03  CPU:  516.396 

01C00C0100  0100000110  1 0 l 1 1 1 1 00 1  001101000  S:  0.288785C0E  05 
0010002000010100 
N:  128  R:  0 

**PF(58) :  0100001000 

COST:  0 .20333 75 1 E  03  CPU:  325.395 

0100001000  0100001000  1010110011  C00000CC1  S:  0.23434500E  05 
0010001000010100 
N:  i44  P:  0 

**PF(59)S  0100001010 

COST:  0 .244045  j  OE  03  CPU:  33  1  .347 

0100001010  0100001010  1010*  5001  000000000  S:  0,248 10500E  05 
0010001000010100 
N:  96  R:  0 

**PF(60) :  0100001100 

COST:  0.25072789E  03  CPU:  337.344 

0100001100  0100001100  1010110011  00C0000C1  S:  0.361 1 0500E  05 

0010001000010100 

N:  96  P:  0 

**PF( 61 ) :  C100C10000 

COST:  0.2 1661516E  03  CPU:  346,152 

0100010000  01010 100C0  1010101011  000000001  S:  0.25262500E  05 
0010001000010100 
N:  144  P:  0 

**PF(62):  0100010010 

COST:  0.25732227E  03  CPU:  351.952 

010C010010  0101010010  1010101001  OOCOOOOOO  S:  0,2661 8 500E  05 

C01000100C01010C 

MS  96  P:  0 
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.' »>**«* 


**PF(63 ) :  0100010100 

port*  n  P7np^iQ^r  ppm.  xs7  rqq 

oloooicioo  0100010110  lonioiooi  oonooooo  s:  0.34302500E  05 
0010002000010100 
N:  96  Fs  0 

**PF(64):  0100100000 

COST:  0.19765756E  03  CPU:  366,142 

0100100000  0100100000  1010001111  000000111  S:  0.23454500E  05 
0010001000010100 

C1001000CQ  0100100000  1010011011  000001001  S:  0.23454500E  05 
0010001000010100 
N:  128  P:  0 

**PF<65):  0100100010 

C’OST:  0 *238365 1 1  £  03  CPU:  372.274 

0100100010  0100100010  1010001 10 1  000000100  S:  0.248 1 0500E  05 
0010001000010100 

0100100010  0100100010  101001  1001  000001000  S:  0.248 1 050CE  05 
0010001000010100 
N:  96  R:  0 

**PF(66):  0100100100 

COST:  0.25187988E  03  CPU:  378.349 

0100100100  0100100100  10 IOC  011  000001001  S:  0.361 1 0500E  05 
0010001000010100 
NS  96  R:  0 

**PF(67)i  0100101000 

COST:  0 .2  182895  IE  03  CPU:  384.432 

0100101000  0100101000  1010010011  000000001  S:  0.28878500E  05 
0010001000010100 
N:  96  P:  0 

**PF<68):  0100101010 

COST:  0.25899683E  03  CPU:  388.453 

0100101010  01CC101010  1010010001  000000000  S:  0.30234500E  05 

0010001000010100 

N:  64  R:  0 

**PF(69 ) :  0100101100 

COST:  0.26567969E  03  CPU:  392.515 

0100101100  0100101100  1010010011  000000001  S:  0,41 534500E  05 

0010001000010100 

n:  64  P:  0 
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**PF<70):  0100110000 


COST:  0.23 1  55510E  03  CPU:  398.665 

0100110000  0100110000  1010001011  000000001  S:  0.28878500E  05 
0010001000010100 
N:  96  R:  0 

**PF(7l ):  0100110010 

COST:  0.27226245E  03  CPU:  402.717 

0100110010  0100110010  1010001001  000000000  S:  0.30234500E  05 
0010001000010100 
N*  64  P:  0 

**PF(72):  0100110100 

COST:  0.2?578516E  03  CPU:  406.750 

0100110100  0100110100  1010001011  D00000001  S:  0 .41 534500E  05 
0010001000010100 
'N:  64  P:  0 

**PF( 73 ):  0101000000 

COST:  0.201  15356E  03  CPU:  414.999 

01C1000000  0101000000  1010101111  000000111  S:  0.2209B500E  05 
0010001000010100 

0101000000  0101000000  1010111011  000001001  S:  0.22098500E  05 

0010001000010100 

N:  128  R:  0 

**PF( 74) :  0101000010 

COST:  0.24 1861 10E  03  CPU:  421.120 

0101000010  0101000010  1010101 101  000000100  Si  0.23 4545C0E  05 
0010001000010100 

0101000010  0101000010  1010111001  000001000  S:  0.23454500E  05 

0010001000010100 

N:  96  R:  0 

**PF(75>:  0101000100 

COST:  0.2553P390E  03  CPU:  427,200 

oioioooioo  oioioocioo  loiomoii  000001001  s:  0.34754500E  05 
0010001000010100 
N:  96  P:  0 

**PF( 76):  0101001000 

COST:  0.221  793 50E  03  CPU:  433  .278 

0101001000  0101001000  1010110011  000000001  S:  0.27522500E  05 
0010001000010100 
N:  96  P:  0 
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**PF(77):  0101001010 


rrrcT.  n  rn  ppii*  4*7  pq 

0101001010  0101001010  1010*110001*  000000000  S:  0.28878500E  05 
0010001000010100 
N:  64  Ps  0 

**PF(78) :  0101001100 

COST:  0 .269  18359E  03  CPU:  441  .349 

0101001100  0101001100  1010110011  000000001  S:  0.401 7850CE  05 
0010001000010100 
N:  64  P:  0 

**PF( 79):  0101010000 

COST:  0 ,235059 10E  03  CPU:  447.451 

0101010000  0101010000  1010101011  000000001  S:  0.27522500E  05 
0010001000010100 
N:  96  P:  0 

**PF(80) :  0101010010 

COST:  0.27576636E  03  CPU:  451  .474 

0101010010  0101010010  1010101001  000000000  S:  0.28878500E  05 
0010001000010100 
N:  64  R:  0 

**PF(81 ):  0101010100 

COST:  0.28928882E  03  CPU:  455.490 

0101010100  U 10 10 10 100  1010101011  000000001  S:  0,401 78500E  05 
0010001000010100 
N:  64  P:  0 

Pk28>  CONTAINS  BEST  SOLUTION(S).  7232  SOLUTIONS  CONSIDEPED. 

MEAN  COST  OVER  ALL  SOLUTIONS:  0.28151 1 23E  03 

MAXIMUM  COST  ENCOUNTERED:  0.42425903E  03 

******* <***  SOLUTION  TERMINATED  ********** 
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Appendix  G 


SOLUTION  LISTING  FOR  CASE  3 
OF  THE  MEDICAL  DIAGNOSIS  PROBLEM 


**********  SOLUTION  BEGINS  ********** 


**BF(  1)S  0000000000 

COST:  0.27245435E  03  CPU:  4.065 

0000000000  0001010000  II 10101111  110000111  S:  0.30686500E  05 
CO  I  0001 0000 101 00 
N:  64  R:  0 

**PF (  2):  0000000010 


COST:  0.3150891  IE  03  CPU:  8.025 

0000000010  OCOIOOOOIO  1  1 10  10  1  100  110000100  S:  0.15544500E  05 

0010001030010100 

N:  64  P:  0 

**PF(  3):  CC00000100 


rn^T*  n  m  ppn*  1 1  004 

0000000100  0100000100  101  1  *1 1  10  10"  001  101000  S:  0.32042500E  05 
0010001000010100 

0000000100  0001000100  1110111010  1 1 000 1 000  S:  0.32042500E  05 
CO  1000 100001 0100 
N:  64  R:  0 


**PF<  4):  000000 1000 


COST:  0.26675464E  03  CPU:  17.696 
0000001000  0000001100  1011110011  001100001 
0010001000010100 

0000001000  0000001100  11 101  10011  110000001 

0010001000010100 

N:  96  P:  0 


S:  0.33398500E  05 
S:  0.33398500E  05 
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**PF(  5):  0000001010 


COST;  0.323 1892 IE  03  CPU:  21.550 

0000001010  0000001010  1011110000  001100000  S;  0.18256500E  05 
0010001000010100 

0000001010  0000001010  1110110000  110000000  S:  0, 182565C0E  05 

0010001000010100 

N:  64  R:  0 

**PF(  6):  0000001100 


COST;  0.31553931E  03  CPU;  25.466 

0000001100  0000001100  1011110010  001100000  S;  0.34754500E  05 
0010001000010100 

0000001100  0000001100  1110110010  110000000  S;  0.34754500E  05 

0010001000010100 

N;  64  P;  0 

**PF(  7);  0000010000 


COST;  0.2  7250854E  03  CPU:  31.468 

0000010000  0100010100  1011101011  001100001  Sf  0 .3 4754500E  05 
001C001000010100 

0000010000  0001010100  1110101011  ’10000001  S:  0.347545C0E  05 
0010001000010100 
Nt  96  P:  0 

**PF<  8):  0000010010 


COST;  0,328943 12E  03  CPU:  35,421 

0000010010  0100010010  1011101000  001100000  S:  0. 19612500E  05 
0010001000010100 

0000010010  OCO 1010010  1110101000  1100000CC  S:  0. 1S612500E  05 

001000100001C100 

N;  64  P:  0 

**PF(  9):  C000C1010C 


COST;  0.32619287E  03 
0000010100  0100010100 
0010001000010100 
0000010100  000 1 C  1 0 1 00 
C0100010C001010C 
N;  64  P;  0 


PP  !!•  XQ  T  7  A 
1011101010  001100000 

1  1  10101010  110000000 


S;  0,361 10500E  05 
S;  0.361 10500E  05 
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**PF (10);  OCOOIOOOCO 


COST:  0.262S5435E  03  CPU:  47.33C 

00001000000000110000  1011001111  001000111  S:  C.33398500E  05 
0010001000010100 

0000100000  0000100100  1011011011  001001001  S:  0.33398500E  05 
U010001000010100 

00001000000000110000  1110001111  110000111  S:  0.3339850CE  05 
0010001000010100 

0000100000  0000100100  1 1 1 00 1 1 0 1 1  1  1000  100  I  S:  0.3339850CE  05 
0010001000010100 
NS  128  R:  0 

**PF( 11):  0000100010 

COST:  0.3193891SE  03  CPU:  53  .276 

0000100010  0000100010  1011001100  001000100  S:  C.18256500E  05 
00100010CC010100 

0000100010  0000100010  101101  1000  001C01000  S:  0 . 1 8256500E  05 
0010001000010100 

0000100010  0000100010  1110001100  110000100  S:  0 . 1 82565CCE  05 
0010001000010100 

0000100010  000010001C  1110011000  110001000  S:  0 . 1 825650CE  05 

0010001000010100 

N:  S6  R:  0 

**PF( 12) :  0000100100 

COST:  0.3! 633936E  03  CPU:  59.187 

0000 100  100  0000100100  <011011010  CC 100 1 000  S:  0.3475 45C0E  05 
OOIOOOIOOCOIOIOO 

0000100100  0000100100  12S0011010  110005000  S:  0.347545C0E  05 

CQlOOOlCOOOlOlOO 

N:  96  R  5  0 

**PF( 13 )?  0000101000 

COST:  C.2F635449E  03  CPU:  65.039 

00001010C0  CC00S01J00  lOUOlOOH  OOIOOOOOI  S:  C.3746S500E  05 

00 1 000 1 ocoo ic 1 00 

0000301000  COOCIOUOO  1  1  1 OC  100 1  2  1 1  000000  i  S  5  0.3746650CE  05 
0010001000010100 
N:  96  P:  0 

**PF( 14):  OOOO’OIOIO 

COST:  0.3427P906E  03  CPU:  68.942 

0000101010  0000101010  1011010000  001000000  S:  0 ,2232 45C0E  Cc 

00  moo  sooooio  100 

0000  3  01010  0000101010  1110010000  i 10000000  S:  0.2232450CE  05 
CO  1 000 10000 10 100 
N :  64  P  *  0 
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**PF(15):  0000101100 


COST:  0 .335 13940E  03  CPU:  72.SB0 

0000101100  0000101100  1011010010  001000000  S:  0.38822500E  05 
0010001000010100 

0000101100  0000101100  1110010010  110000000  S:  0.38822500E  05 

0010001000010100 

N:  64  R:  0 

**PF(16):  0000110000 

COST:  0.29030P35E  03  CPU:  78.796 

00001100000000110100  1011001011  001000001  S:  0.37466500E  05 
0010001000010100 

0C00110000  0000110100  1110001011  110000001  S:  0.37466500E  05 
0010001000010100 
N:  96  R:'  0 

**FF (17):  0000110010 

COST:  0.34674292E  03  CPU:  82.705 

0000110010  0000110010  1911001000  001000000  S:  0.22324500E  05 
0010001000010100 

00001 1001C  C0001 10010  1110001000  110000000  S:  0.22324500E  05 
0010001000010100 
N:  64  P;  0 

**PF( 18):  OOC  ’ 10100 

COST:  0.34399292E  03  CPU:  86.640 

0000110100  0000110100  101 1001010  001000000  S:  0.38822500E  05 
0010001000010100 

0000110100  0000110100  1110001010  110000000  S:  0.38822500E  05 

00100010C001C1C0 

N:  64  R:  0 

**PF( 19 ) :  000 1C00000 

COST:  0 . 25885449 E  03  CPU:  98.333 

0001000000  0001010000  1110101111  110000111  S:  0 ,3249 4500E  05 
0010001000010100 

0001000000  0001000100  111011  1011  1  10001001  S:  0.32494500E  05 
0010001000010100 
N:  192  R:  0 

**PF(20) :  0001000010 

COST:  0 ,3 1 528906E  03  CPU:  106.930 

00C1000010  0001000010  1110101100  110000100  S:  0 . I 7352500E  05 
C010001000010100 

C0010000.10  0001000010  11  1011  1000  I  10001000  S:  0 . 1  7352500E  05 
0010001000010100 
N:  144  P:  0 


**PF(21 ):  0001000100 


COST:  0.3  125393  1 E  03  CPU?  115.413 

OOCIOOOIOO  0001000100  lllOUIOlO  HOOOIOOO  S:  0.3385050CE  05 
0010001000010100 
N:  144  R!  0 

**PF(22):  0001001000 

COST:  0.28255469E  03  CPU:  123.930 

0001001000  0001001100  1110110011  110000001  S:  0.36562500E  05 
0010001000010100 
N:  144  P:  0 

**PF(23 ):  0001001010 

COST:  0 .3389890 IE  03  CPU:  129  .528 

0001001010  COO  1001010  1110110000  1  10000000  S:  0 ,2 1 420500E  05 
0010001000010100 
N:  96  R:  0 

**PF(24):  0001001100 

COST:  0.33 133936E  03  CPU:  135.240 

0001001100  0001001100  1110110010  110000000  S:  0.379 18500E  05 
0010001000010100 
N:  96  P:  0 

**PF(25) :  0001010000 

COST:  0.28650B54E  03  CPU:  143.812 

0001010000  0001010100  1110101011  110000001  S:  0.36562500E  C5 

0010001000010100 

N:  144  P:  0 

**PF(26) :  0001010010 
rnsT*  n  ppii*  tqr 

oooi o i oo i o  ooo loiooio  moioiobb  110000000  s:  0.21420500E  05 

001000100C010100 
M:  96  F:  0 

**PF(27):  0001010100 

COST:  0.34019287E  03  CPU:  155.100 

000 1 0  1 C 1 00  0001010100  1110101010  110000000  S:  0.37? 18500E  05 

001000100C010100 

N:  96  P:  0 
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★*PF(28>:  0010000000 


COST:  0.26295435E  03  CPU:  160.957 

OOIOOOOOOO  0010010000  1001 1C1 1 1 1  000100111  S:  0.33398500E  05 
0010001000010100 

OOIOOOOOOO  0010000100  1001111011  000101001  S:  0.33398500E  05 

0010001000010100 

N:  96  R:  0 

**PF(29):  0010000010 

COST:  0.3 19389 16E  03  CPU:  165.814 

0010000010  0010000010  1001101  100  000100100  S:  0. 1825 J500E  05 
0010001000010100 

0010000010  0010000010  1001111000  000101000  S:  0.18256500E  05 
0010001000010100 
N:  80  R:-  0 

**PF(30):  0010000100 

COST:  0.31323926E  03  CPU:  170.606 

0010000100  0011000100  1100111010  100001000  S:  0.33398500E  05 
001 0001 0000 10  IOC 
N:  80  P:  0 

**PF(3 1 ):  0010001000 

COST:  0 .281 45459E  03  CPU:  176.327 

0010001000  0010001100  1100110011  100000001  S:  0.34754500E  05 
0010001000010100 
N:  96  P:  0 

**PF(32):  0010001010 

COST:  0.33788892E  03  CPU:  180.070 

0010001010  0010001010  1100110000  100000000  S:  0.19612500E  05 

0010001000010100 

N:  64  R:  0 

**PF(33):  0010001100 

COST:  0.33023926E  03  CPU:  183  .S 49 

0010001100  0010001100  1100110010  100000000  S:  0,361 10500E  05 
0010001000010100 
N:  64  R:  0 

**PF(34):  0010010000 

COST:  0.28720850E  03  CPU:  189  .739 

0010010000  0011010100  1100101011  100000001  S:  0.361 10500E  05 
0010001000010100 
N:  96  P:  0 
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**PF(35):  0010010010 


COST:  0.34364307E  03  CPU:  193.514 

0010010010  0011010010  1100101000  100000000  S:  0.209 68 5C0E  05 

0010001000010100 

N:  64  P:  0 

**PF(36):  001001C100 

COST:  0.34089307E  03  CPU:  197.332 

0010010100  0011010100  1100101010  100000000  S:  0.37466500E  05 

0010001000010100 

N:  64  R:  0 

**PF(37):  0010100000 

COST:  0.27765430E  03  CPU:  203.134 

0010100000  00101  10000  1100001  1  11  1000001  11  S:  0 .3 4754500E  05 
0010001000010100 

00101000000010100100  1100011011  100001001  S:  0.34754500E  05 

0010001000010100 

N:  96  R:  0 

**PF(38>:  0010100010 

COST:  0.3340891  IE  03  CPU:  207.387 

0010100010  0010100010  1100001100  100000100  S:  0.19612500E  05 
0010001000010100 

0010100010  0010100010  1100011000  100001000  S:  0. 1961 250CE  C5 

0010001000010100 

N:  72  P:  0 

**PF(39):  0010100100 

COST:  0.33103931E  03  CPU:  211.680 

0010100100  001010010C  1100011010  100001000  S:  C .361 1 0500E  05 

0010001000010100 

N:  72  P;  0 

**PF(40):  0010101000 

COST:  0.30105444E  03  CPU:  215.992 

00101010000010101100  1100010011  100000001  S:  0.38822500E  05 
0010001000010100 
N:  72  P:  0 

**PF(41 ):  0010101010 

COST:  0 .3574890 1  E  03  CPU:  218.807 

0010101010  0010101010  1100010000  100000000  S:  0.23680500E  05 
0010001000010100 
N:  48  R:  0 
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**PF(42):  0010101100 


COSTS  0*3498391 1 E  03  CPU:  221.700 

0010101100  0010101100  1100010010  100000000  S:  0.401 78500E  05 
0010001000010100 
N:  48  R:  0 

**PF(43):  0010110000 

COST*  0.30500830E  03  CPU:  226.028 

0010110000  0010110100  1100001011  100000001  S:  0.38822500E  05 
0010001000010100 
N:  72  R:  0 

**PF(44):  0010110010 

COST:  0.361443 12E  03  CPU:  228  .9°3 

0010110010  0010110010  1100001000  100000000  S;  0.23680500E  05 
0010001000010100 
N:  48  R:  C 

**PF(45):  0010110100 

COST:  0.35869287E  03  CPU:  231.57 

0010110100  0010110100  1100001010  100000000  S:  0.401 78500E  05 

0010001000010100 

N:  48  R:  0 

**PF(46):  0011000000 

COST:  0 .271 65430E  03  CPU:  239.698 

001 1000000  001 1010000  1100101111  100000111  S:  0.33850500E  05 
0010001000010100 

0011000000  0011000100  1 100 111011  100001001  S:  0.33850500E  05 

0010001000010100 

N:  128  R:  0 

**PF(47);  0011000010 

COST:  0.3280891  IE  03  CPU:  245.550 

0C1  1000010  0011000010  1  100101100  1C0000100  S:  0.1870850CE  05 
0010001000010100 

0011000010  0011000010  1100111000  100001000  S:  0.18708500E  05 

noioooiooooioioo 

N:  96  P;  0 

**PF(48):  0011000100 

rnPT*  0  pp  I '  •  <'51  *1/ 

oonoooioo  0011000100  noomoio  100001000  S:  0.35206500E  05 
0010001000010100 
N:  96  R:  0 
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**PF(49>:  001  1001000 
COST:  0.29535449E  03  CPU:  257.043 

0011001000  0011001100  1100110011  100000001  S:  0.3791850CE  05 

0010001000010100 

N:  96  R:  0 

**PF(50):  0011001010 

COST:  0.351 78906E  03  CPU:  260.826 

001 1001010  001 1001010  1100110000  100000000  S:  0.2277650CE  05 

0010001000010100 

N:  64  R:  0 

**PF(51):  0011001100 

COST:  0 .344 1  39 1 6E  03  CPU:  264.714 

0011001100  0011001100  1100110010  100000000  S:  0.39274500E  05 

0010001000010,00 

N:  64  R:  0 

**PF<52>:  C011010C00 

COST:  0.29930835E  03  CPU:  270.519 

0011010000  0011010100  1100101011  100000001  S:  0.37918500E  05 
0010001000010100 
Nt  96  R:  0 

**PF(53):  0011010010 

COST:  0 .355743  1 6E  03  CPU:  274.326 

0011010010  0011010010  1100101000  100000000  S:  0.2277S5C0E  05 

001000100001010C 

N:  64  R:  0 

**PF(54):  001 1 C 1 0 100 

COST:  0.35299292E  03  CPU'  278.133 

0011010100  0011010100  1 100 10 10 10  100000000  S:  0.39274500E  05 
0010001000010100 
H:  64  P:  0 

**PF(55):  0100000000 

COST:  0.25885449E  03  CPU:  287.964 

0100000000  0100010000  1011101111  001100111  S:  0.324S4500F  05 
0010001000010100 

01000000000100000100  1011111011  001101001  S:  0.32494500E  C5 

OOlOOOlOOOOlOiOO 

N:  160  R:  0 
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**PF(56;5  OICGOOOOIO 


COST:  0 .3  ! 528906E  03  CPU:  295.706 

01000C0010  0100000010  10 1 1 101  ICO  001100100  S:  0,1 7352500E  05 
0010001000010100 

0100000010  0100000010  1011111000  001101000  Ol  C, 1 7352500E  05 

0010001000010100 

N*  128  Pt  0 

**PF(57):  01000C0100 

COST:  0.3  108391  IE  03  CPU:  303,393 

010C000100  C 101000100  1010111010  000001000  S:  0.333S8500E  05 
0010001000010100 
N:  128  P:  0 

**PF(58):  0100001000 

.  OST:  0 .2790544  4E  03  CPU:  311.970 

0100001000  0100001100  1010110011  000000001  S:  0.34754500E  Of 
0010001000010100 
N:  144  P :  0 

**PF<59):  01C00C1010 

COST:  0 .3354890  IE  03  CPU:  317.647 

0100001010  0100001010  10101 10000  000000000  S:  0.1961*,:C0E  05 

0010001000010100 

fi:  96  R:  0 

**PF(60):  0100001100 

COST:  0.3278391  IE  03  CPU:  323.387 

0100001100  0100001100  1010110010  000000000  S:  0.351 10500E  05 

0010001000010100 

N:  96  P:  0 

**PF(61)j  0100010000 

COST:  0.28480830E  03  CPU:  332, CIS 

0100010000  0101010100  1010101011  000000001  S:  0,361 10 50 OF.  05 
001000100001010^ 

N:  144  P:  0 

**PF(62):  0100010010 

rneT.  n  ns  rPH*  ^7  7n* 

DincOlOOlO  0101010010  lOioioiOOO  000000000  S:  0.20968500E  05 
0010001000010100 
N:  96  P:  0 
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**PF(63):  0100010100 


COST:  0.33849316E  03  CPU:  343.408 

0100010100  0101010100  1010101010  000000000  S:  0.374665CCE  05 

0010C010C0010100 

N:  96  R:  0 

**PF( 64 ) :  0100100000 

TORT*  0  P75P5433F  03  CPU*  35!  IPO 

0100*100000  01001  10000  1010001  1  11  00000011  !  S:  0.3475450CF  05 
0010001000010100 

0100100000  0100100100  1010011011  000001001  S:  0.34754500E  05 

0010001000010100 

N:  128  R:  0 

**PF(65>:  0100100010 

COST:  0 .33 1 68921 E  03  CPU:  356.826 

0100100010  0100100010  1010001100  000000100  S:  0*1961 25C0E  05 
0010001000010100 

0100100010  0100100010  1010011000  000001000  S:  0.1961 2500E  05 

0010001000010100 

N:  96  P:  0 

**PF(66):  0100100100 

COST:  0.32863916E  03  CPU:  362.515 

0100100100  0 1 00100  10O  1010011010  000001000  S:  0,361 1 0500E  05 
00 1000 1 000010 100 
N:  96  P:  0 

**PF(67):  Cl OOi 01000 

COST:  0.29805454E  03  CPU:  368.200 

0100101000  0100101100  1010010011  000000001  St  0 .38822.500E  05 

0010001000010100 

N:  96  R?  0 

;<*PF(  68) :  0100101010 

COST:  0.3550891  IE  03  CPU:  371.555 

0100101010  0100101010  1010010000  OOOOOOCOO  S:  0.23680500E  05 
0010001000010100 
N :  64  P :  0 

**PF(6S):  0100101100 

COST:  0.34743896E  03  CPU:  375.765 

0100101100  0100101100  1010010010  000000000  S:  0.40178500E  05 
,0010001000010100 
N:  64  P;  0 
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**PF(70):  0100110000 


COST!  0 .30260840E  03  CPU:  381.511 

0100110000  0100110100  1010001011  000000001  S:  0.38822500E  05 

0010001000010100 

N:  96  R:  0 

**PF(71 )s  0100110010 

rn^T*  n  *  5904^9  if  ox  ppm.  pqh 

0100*1  10010*01001  10010  1010601006  OOOOOOOOO  S:  0.23680500E  05 
0010001000010100 
N:  64  R:  0 


**PF(  72 ) :  0100110100 

COST:  0.35629297E  03  CPU!  389.080 

01001 lu 1 00  0100110100  1010001010  000000000  S:  0.401 78500E  C5 
0010001000010100 
N:  64  R:  0 

**PF(73):  0101000000 


COST!  0.271  15430E  03  CPU:  396. 7f,5 

0101000000  0101010000  1010101111  000000111  S:  0.33850500E  05 
0010001000010100 

0101000000  0 1 01000 1 n9  1010111011  000001001  S!  0.33850500E  05 

0010001000010100 

N:  128  R:  0 

*kPF(74):  0101000010 


COST:  0.3275891  IE  03  CPU:  402.623 

0101000010  0101000010  1010101100  000000100  S:  0, 1 8708500E  05 
0010001000010100 

0101000010  0101000010  lOlOll'JOO  000001000  S:  0, 1 8708500E  05 

0010001000010100 

N:  96  P:  0 


**PF( 75) :  0101000100 

COST:  0.324839!  IE  03  CPU:  408.3  15 

u 1 C 1 000 1 00  0101000100  1010111010  000001000  Ss  0.35206500E  05 
00 1 000 i 0000  10100 
N:  96  P:  0 

■^*PF(  76) :  0 1 0 1  CO  1000 


COST:  0.294*»5449E  03  GPU:  413.986 

0101001000  OlOlOOllCO  iOlOllOOll  OOOOOOOOl  S:  0.37S18500F  05 

0010001000010100 
N:  96  R;  0 
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**PF(77):  010 10010 1C 

COST:  0.35128906E  03  CPU:  417.735 

0101001010  0101001010  1010110000  000000000  S:  0.22776500E  05 
0010001000010100 
N:  64  P:  0 

**PF(78):  0101001100 

COST:  0.34363892E  03  CPU:  421.520 

0101001100  0101001  100  10 1C  1 1 00 10  000000000  S:  0.39274500E  05 
0010001000010100 
N:  64  R:  0 

**PF(79):  0101010000 

COST:  0.2988081  IE  03  CPU:  427.219 

0101010000  0 10 1 C 1 0 1 00  1010101011  000000001  S:  0.37918500E  C5 
0010001000010100 
N:  96  R:  0 

**PF<80):  0101010010 

COST:  0.35524292E  03  CPU:  430.977 

0101010010  0101010010  1010101000  000000000  S:  0.22776500E  05 
0010001000010100 
N:  64  P:  0 

**PF(81>:  0101010100 

COST:  0  .352493  1 6E  03  CPU:  434.735 

0101010100  0101010100  1010101010  000000000  S:  0.3S274500E  05 
0010001000010100 
N:  64  R:  0 

PF(55)  CONTAINS  BEST  SOLUTIONS  ).  7232  SOLUTIONS  CONSIDERED. 

MEAN  COST  OVER  ALL  SCM'TIONS;  0.35939697E  03 
MAXIMUM  COST  ENCOUNTERED:  0.5859P242E  03 

* *********  SOLUTION  TERMINATED  ********** 
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Appendix  H 


SOLUTION  LISTING  FOR  PRIMITIVE  OPERATION  Q? 
IN  THE  MEDICAL  DIAGNOSIS  PROBLEM 


******?****  SOLUTION  BEGINS  ********** 

**PF (  1):  0000000000 

COST:  0.3 1459985E  03  CPU:  2.052 

oooooooooo  oioooooooo  loimmi  ooncmi  S:  0.19160500E  05 
0000001000000000 

oooooooooo  0001000000  mciiiiii  110001111  S:  0.19160500E  05 
0000001000000000 
N:  64  R:  0 

**PF(  2) :  0000000010 

COST:  0.38659985E  03  CPU:  4.135 

0000000010  0 1 000000 1C  1011111101  CO K 101 100  S:  0 • 19520S00E  05 
0000001000000000 

0000000C10  0001000010  1  1101  1  1101  110001  100  S:  0 . 19 520500E  05 
0000001000000000 
N:  64  R:  0 

**PF(  3):  0000000100 

COST:  0.37699976E  03  CPU:  6.150 

0000000100  0100000100  1011111011  001101001  S:  0.20360500E  05 
0000001000000000 

0000000100  0001000100  1110111011  110001001  S:  0.20360500E  05 

0000001000000000 

N:  64  P:  0 

**PF<  4):  0000001000 

COST:  0.32059985E  03  CPU:  9.037 

0000001000  0100001000  1011110011  001100001  S:  0,1 5560500E  05 
0000001000000000 

0000001000  0001001000  1.1 1 0 1  1 00 1  1  110000001  S:  0.15560500E  05 
0000001000000000 
N*  96  R:  0 
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**PF(  5):  0000001010 


COST:  0.39019995E  03  CPU:  11*008 

0000001010  0100001010  1011110001  001100000  S:  0.15920500E  05 
COOOCOIOCOOOOOOO 

00000010100001001010  1110110001  110000000  S:  0.15920500E  05 
COOOOO 1000000000 
Nt  64  R:  0 

**PF(  6):  0000001 100 

COST:  0.40099976E  03  CPU:  13.001 

0000001100  0100001100  1011110011  00110C001  S:  0.239605C0E  05 
OOCOOOIOOOOOOOOO 

0000001  100  0001CC1 100  1110110011  110000001  S:  0.23960500E  05 

0000001000000000 

N:  64  R:  0 

**PF(  7):  0000010000 

COST:  0 .342 1 997 1 E  03  CPU:  15.927 

0000010000  0100010000  1011101011  001100001  S:  0.I556050CE  05 
000C00100000000C 

0000010000  00C101C000  1110101011  110000001  S*  0 . 1 556050CE  05 

ooc;  'r\ ooooooooo 
Nt  'a  P:  0 

**PF(  8):  0000010010 

n  4ii7QC)onF  ppii*  17  qpi 

0000*010010  0100010010  10  J 1 10 100 1'  001 100000  S:  0  .  i 59  2050CE  05 
0000001 OOOOOOOOO 

00000100100001010010  11 10101001  1100C0000  S:  0.15920500E  C5 

0000001000000000 

Nt  64  R:  0 

**PF(  9):  0000010100 

rnsT*  n  4^45qqssf  ppm* 

oooooioioo  oioooioioo  louicoon  001100001  s:  0.23960500E  05 
000000 1000000000 

00000101000100010100  1011101011  OOllOOOOl  S:  G.2396C50CF  05 
0000001000000000 

OOOOOIOIOO  0001010100  moioooil  IlCOOOOOl  S :  0.23960500F  05 
oooocoi ooooooooo 

0000010100  0001010100  1 1 1010101 i  1  i 000000 1  S:  0.2396050CE  05 

0000001000000000 

N:  64  R:  0 
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**PF<10):  0000100000 


COST:  C.30859985E  03  CPU:  23.736 

0000100000  0100100C00  10110011  11  001000111  S:  0.1 556050CE  05- 
0000001000000000 

0000100000  0100100000  1011011011  001001001  S:  0,1 5560500E  05 
COOOOO 1000000000 

0000100000  0001100000  1110001111  110000111  S:  0. 1 5560 500E  05 
0000001000000000 

0000100000  0001100000  111001 101 1  110001001  S:  0, 1 5560500E  C5 

0000001000000000 

N:  128  P:  0 

**PF(11):  0000100010 

COST:  0.3781S971E  03  CPU:  25.644 

0000100010  0100100010' 101 1001 101  001000100  S:  0, 1 5S20500E  05 
0000001000000000 

0000100010  01001C0010  1011011001  001001000  S:  0. 1 5920500E  05 
0000001000000000 

0000100010  0001100010  1110001101  110000100  S:  0 . 1 5S20500E  05 
0000001000000000 

0000100010  0001100010  1110011001  110001000  S:  0 . 1 5S205C0E  05 

0000001000000000 

N:  96  P:  0 

**PF( 12 ):  0000100100 

COST:  0.40099976E  03  CPU:  29.525 

0000100100  0100100100  10 1 10 1 00  1 1  001000001  S:  0.23960500E  05 
0000001000000000 

0000100100  0100100100  1011011011  001001001  S:  0.23960500E  05 
0000001000000000 

0000100100  0001100100  11 1 00 1 00 l 1  110000001  S:  0.23S6C500E  05 
0000001000000000 

0000100100  0001100100  1110011011  110001001  S:  0.23960500E  05 

0000001000000000 

N:  96  P:  0 

**PF(13):  0000101000 

COST:  0.34459985E  03  CPU:  32.450 

00001010000100101000  1011010011  001000001  S:  0.191 60500E  05 
0000001000000000 

0000101000  C0CU0100C  1 1  100 1 00 1 1  110000001  S:  0.191605C0E  05 

0000001000000000 

N:  96  P:  0 
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**PF( 14):  0000101010 


COST:  0.414 1999 5E  03  CPU:  34.404 

0000101010  0100101010  1011010001  001000000  S:  0, 1952C5CCE  05 
0000001000000000 

0000101010  000 1  1C  10 10  1110010001  1  10000000  S:  0. 19  520500E  05 
0000001000000000 
N:  64  R:  0 

**PF< 1 5) :  0000101100 

COST:  0.4249S976E  03  CPU:  36.357 

0000101100  0100101  100  1011010011  001000001  S:  0 ,275(.050CE  05 
000000 lOOOOOObOO 

0000101100  OOC 1  101 100  1  1  10010011  1  10000001  S:  0. 27560 5C0E  05 
0000001000000000 
N:  64  R:  0 

**PF( 16):  0000110000 

COST:  0 .366 1997 1 E  03  CPU:  39.190 

0000110000  0100110000  1011001011  001000001  S:  0.191 60500F  05 
0000001000000000 

0000110000  000 1 1 1 CCOO  1110001011  110000001  S:  0,191 6050CE  05 

0000001000000000 

N:  96  P:  o 

**PF( 1 7) :  0000110010 

COST:  0.43579980E  03  CPU:  41. 1*0 

0000110010  0100110010  1011001001  Cb:0C0000  S:  C, 19 52C5GCF  05 
0000001000000000 

0000110010  0001110010  1110001001  110000000  S:  0,19 520500E  05 
0000001000000000 
Nt  64  P:  0 

**PF( IS) :  0000110100 

COST:  0.45859985E  03  CPU:  43  .082 

0000110100  0100110100  1011000011  001000001  S:  0.27560500E  05 
0000001000000000 

0000110100  0100110100  iOl 100101 1  001000001  S:  0.27560500E  05 
0000001000000000 

00001  10100  0001 : 10100  lllOOOOOll  llOOOOOOl  S:  0.2756C500E  05 

00  '^nniooooooooo 

oo.,.  • oooi  i  ioioo  moooion  110000001  s:  0.27560500E  05 

000001 1000000000 
f,j  64  P:  0 


**PF( 19) 


0001000000 


COST:  0.3 1259985E  03 
C001000000  0001000000 
0000001000000000 
0001000000  0001C00000 
0000001000000000 
0001000000  0001000000 
0000001000000000 
0001000000  0001000000 
0000001000000000 
0001000000  0001000000 
0000001000000000 
0001000000  0001000000 
0000001000000000 
0001000000  0001000000 
0000001000000000 
0001000000  0001000000 
0000001000000000 
oooioooooo  oooiooeooo 
0000001000000000 
OOOIOOOOOO  OOOIOOOOOO 
0000001000000000 
OOOIOOOOOO  OOOIOOOOOO 
0000001000000000 
OOOIOOOOOO  OOOIOOOOOO 
0000001000000000 
Ns  192  R:  0 


CP’J:  48.855 
0110001111  010000111 

0110011011  010001001 

0110101111  010000111 

0110111011  OiOOOlOOl 

1100001111  100000111 

1 1000 11011  100001001 

1100101111  1000001 1 1 

1100111011  10000100! 

1110001111  110000111 

1110011011  110001001 

1110101111  110000111 

1110111011  110001001 


S:  0. 1336C500E  05 
S:  0 . 1 3360500E  05 
Ss  0, 133605Q0E  05 
S:  0, 13360500E  05 
S:  0, 13360500E  05 
S:  0, 13360500E  05 
S:  0. 133605C0E  05 
S:  0. 1 336050CE  05 
S:  0. 1 33605QCE  05 
S:  0, 1 3360500E  05 
S:  0.13360500E  05 
S:  0.13360500E  05 
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**PF<20):  0001C00010 

COST:  0 .382  1?97 1 E  05  CPU:  53.3S7 


0001000010  000100001C 
0000001000000000 

0110001101 

010000100 

S: 

0. 1 372050CE 

05 

COOIOCmOIO  0001000010 
OOQOOC 1000000000 

0110011001 

01000100C 

S: 

0.13720500E 

05 

0001000010  00010C0010 
0000001000000000 

0110101101 

01000010C 

Si 

0. 1 372050CE 

05 

0001000010  0001000010 
0000001000000000 

0  1 101 1 1001 

01000100C 

Si 

0«  1 3  72050CE 

05 

0001000010  0001000010 
0000001000000000 

1100001 101 

100000100 

S : 

0. 137205C0E 

05 

0001000010  0001000010 
0000001000000000 

1  100011001 

100001000 

S: 

0. 137205C0E 

05 

0001000010  0001000010 
0000001000000000 

1  100101101 

100000100 

S: 

0.1 I720500E 

05 

0001000010  0001000010 
0000001000000000 

1  10011 1001 

100C01000 

c  . 

0.13''205nrr 

05 

0001000010  0001000010 
0000001000000000 

1 1  iooo i i oi 

1  10C00100 

S: 

0.1372C500E 

05 

0001000010  0001000010 
0000001000000000 

1110011001 

110001000 

? : 

0.13  720500E 

05 

0001000010  0001000010 
0000001000000000 

1 1  loioiioi 

1  100C0100 

S: 

0 . 1  3720500E 

05 

0001000010  0001000010 
0000001000000000 

N:  144  R:  0 

111011 1001 

1 1 0001 oco 

S: 

0,13  720500F 

C5 
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**PF(21) 


000 i 000 100 


COST:  0.40499976E  03 
0001000100  0001000100 
0000001000000000 
C001000100  000100010C 
0000001000000000 
0001000100  0001000100 
0000001000000000 
0001000100  0001000100 
0000001000000000 
0001000100  0001000100 
0000001000000000 
0001000100  0001000100 
0000C01000000000 
0001000100  0001000100 
COOOOO  1000000000 
0001000100  0001000100 
ooooooiocooooooo 
0001000100  0001000100 
0000001000000000 
0001000100  0001000100 
OOCOOCl 000000000 
0001000100  0001000100 
OCOOOOIOOOOOOOOO 
0001000100  0001000100 
0000001000000000 
N:  1 44  R:  0 


CPU:  57. 
0  1 100 1C01 1 

851 

01 0000001 

0110011011 

010001001 

011011001 1 

010000001 

0110111011 

010001001 

1 10001001 1 

100000001 

1100011011 

100001001 

1 10011001 1 

100000001 

1100111011 

1 00001 OC 1 

1 11001001  1 

110000001 

1 1 1001 1011 

1 10001001 

1 1 1011001 1 

i  mcooooi 

1 1 1011 101  i 

1  10001001 

S: 

0.21 760500E 

05 

S: 

0. 21760 500E 

05 

S: 

0. 21760 500E 

05 

S: 

0.2 1 760500E 

05 

S: 

0.21760500E 

05 

S: 

0. 21760 500E 

05 

S: 

0.21760500E 

05 

S: 

0 .2 1 760500E 

05 

S: 

0.21 76C500E 

05 

S: 

0.21760500E 

05 

S: 

0.21 760500E 

05 

S: 

0. 21760 500E 

05 

**PF(22):  0001001000 


COST:  0.3485S9S5E  03 
0001001000  0001001000 
0000001000000000 
0C01001000  0001001000 
0000001000000000 
0001001000  0001001000 
0000001000000000 
0001001000  0001001000 
0000001000000000 
0001001000  0001001000 
0000001000000000 
0001001000  0001001000 
OCOOOOIOOOOOOOOO 
N:  144  P:  c 


CPU:  62.324 
0110010011  ^iOOOOOOl 

01 101 1001  1  010000001 

1100010011  100000001 

1100110011  100000001 

1110010011  110000001 

1 1 1 0 1 1  CO  1  1  110000001 


S:  0 . 1  6S 6050CE  05 
S:  0, 1 69 60500E  05 
S:  0, 1  69 60500E  05 
S:  0 . 1  6960500E  05 
S:  0 . 1 6960500E  05 
S:  0, 1 69605C0E  05 
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“<!***!  WUftW* 


**PF(23>:  OOOlOOiOlC 


COST:  0.41819995E  03 

CPU:  65. 

301 

0001001010  0001001010 

onooicooi 

010000000 

S: 

0. 1  73205CCE 

C5 

0000001000000000 
0001001010  0001001010 

oi  lonoooi 

010000000 

S: 

0. 1 73205CCF 

05 

0000001000000000 
0001001010  00CJ001010 

1100010001 

100000000 

S: 

0. 1  732050CE 

05 

000000 1000000000 
0001001010  0001001010 

1100110001 

100C0000C 

S: 

C.1732050CE 

05 

0000001000000000 
0001001010  0001001010 
0000001000000000 

1  110010001 

110000000 

S: 

0. 1732050CE 

05 

OOCIOOIOIO  0001001010 
0000001000000000 

N:  96  R:  0 

1110110001 

110000000 

S: 

0.173205C0E 

C5 

**PF(24):  0001001100 

COST:  0.42899976E  03 

CPU:  68. 

261 

0001001100  0001001100 

0110010011 

010000001 

s: 

0.253 60500E 

05 

0000001000000000 
0001001 100  00010G1 100 

011011001 1 

010000001 

S: 

0.25360500E 

05 

0000001000000000 
oooioonoo  oooioonoo 

1  100010011 

100C00001 

Si 

0.253  6050CE 

05 

0000001 000000000 
0001001 100  0001001 100 

1100110011 

100000001 

Si 

0.2536C50CE 

C5 

0000001000000000 
oooioonoo  oooioonoo 

1110010011 

110000001 

S: 

0 .253  6C50CF 

05 

0000001000000000 
oooiooi  ioo  oooioonoo 

OOOOOOIOOOOOOCCO 

1  110110011 

11000C001 

Si 

0.2536050CE 

05 

N:  96  R:  0 


**PF<25):  0001010000 


COST:  0.37019971E  03  CPU:  72. 

560 

oooioioooo  oooi:jioooo  onoooion 

010000001 

S: 

0. 1  6960500E 

C5 

OOOOOOlOOOOOCuOO 

OOOIOIOOOO  000101C000  011010101! 

010000001 

S: 

0.165  6C50CE 

05 

0000001000000000 

OOOIOIOOOO  OOOIOIOOOO  1100001011 

100000001 

S: 

0,  1 69  605C0E 

05 

0000001000000000 

OOOIOIOOOO  OOOIOIOOOO  110010101! 

100000001 

S: 

0, 1  69  60500E 

05 

0000001000000000 

OOOIOIOOOO  OOOIOIOOOO  1110001011 

11 0000001 

Si 

0. 169  60500E 

05 

0000001000000000 

oooioioooo  oooioioooo  moioion 

ooooooiocooooooo 

110000001 

Si 

0. 169 605CCE 

05 

N:  144  Fj  0 
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I. 


**PF(26) 


OOCICIOOIC 


COST:  0.43979980E  03 
0001010010  0001010010 
000000 lCCOOCOOOO 
0001010010  0001010010 
000000 1000000C00 
0001010010  0001010010 
0000001000000000 
0001010010  0001010010 
0000001000000000 
C0C101CC10  C0C1010010 
0000C01 000000000 
000101C010  0001010010 

i  cocoooiooococooo 

N:  96  F:  0 

**PF (27) s  0001010100 

COST:  0.46259985E  03 
0001010100  0001010100 
000000 100000000C 
0001010100  0001010100 
0000001000000000 
0001010100  0001010100 
000000 1000000000 
0001010100  0001010100 
OOOOOOIOOOOOOOOO 
0001010100  0001C1010C 
OOOOOOIOOOOOOOOO 
0001010100  0001010100 
OOOOOOIOOOOOOOOO 
0001C10100  0001010100 
OOOOOOIOOOOOOOOO 
0001010100  0001010100 
I  OOOCCOIOOCOOOOOO 

0001010100  OOOlOiOlOO 
OOOOOOIOOOOOOOOO 
0001010100  0001010100 
OOOOOOIOOOOOOOOO 
0001010100  000 1010100 
OOOOOOIOOOOOOOOO 
0001010100  0001010100 
OOOOOOIOOOOOOOOO 
M:  96  R:  0 

**PF(2g):  0010000000 
COST:  0.30859985E  03 

0010000000  oucoooooo 

OOOOOOIOOOOOOOOO 
0010000000  01  10000000 
OOOOOOIOOOOOOOOO 
N:  96  P:  C 


CPU:  75.517 


0110001001 

010000000 

S: 

0.1  73205CCE 

05 

0 1 1010 100 1 

010000000 

S: 

0. 1 7320500E 

05 

1100001001 

100000000 

S: 

0,1 7320500E 

05 

1 100101001 

100000000 

S: 

0, 1 7320500E 

05 

1110001001 

110000000 

S: 

0.1732C500E 

05 

1110101001 

1 10000000 

S: 

0, 1 7320500E 

05 

CPU:  78. 
0110000011 

462 

010000001 

S: 

0.25360500E 

05 

01 10001011 

010000001 

S: 

0.253 60 500E 

05 

0110100011 

010000001 

S: 

0, 25360 500E 

05 

01 10101011 

010000001 

S: 

0.253 60 500E 

05 

1 100C0001  1 

100000001 

S: 

0.25360500E 

05 

1100001011 

10000C001 

S: 

0.25360500E 

05 

1 100100011 

100000001 

S: 

0.25360500E 

05 

1100101011 

100000001 

S: 

0.253605Ar,E 

05 

1 1 100H001 1 

110000001 

S: 

0.25360500E 

05 

1110001011 

1 10000001 

S: 

0.253 60500E 

05 

1 110100011 

110000001 

S: 

0.25360500E 

05 

i 1 10101011 

1 10000001 

S: 

0.25360500E 

05 

CPU:  81. 
1001 1011 1 l 

600 

000100! 1 1 

s: 

0 . 1 55605CCE 

05 

1001111011 

000101001 

S: 

0.1556J500E 

05 
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**PF<29) :  00100C0010 
COST:  0.37F19971E  03  CPU:  84. 04° 

0010000010  0110000010  1001101101  00010C100  S:  0, 1 592050CE  C5 
0000001000000000 

001C000010  011000001C  1001111001  000101000  St  0.15920500E  05 

0000C0100000C000 

N:  80  R:  0 


**PF(30):  001000010C 

COST:  0.38299976E  03  CPU:  86.394 

0010000100  0011000100  0100111011  000001001  S:  0.209605C0E  C5 
0000001000000000 

0010000100  0011000100  1100111011  100001001  S:  0.2096C500E  05 
000000 1C00000000 
N:  80  R:  0 


**PF(3 1 ) :  0010001000 

COST:  0.32659985E  03  CPU:  89.261 

0010001000  0011001000  0100110011  000000001  S:  0.161 6C500E  05 
0000001000000000 

0010001000  001  1001000  1100110011  100000001  S::  0.161 60500E  05 

0000C010C0000000 

N:  96  R:  0 


**PF(32):  0010001010 

P  fK  T  •  O  0*  p  P 1 1  •  Q  ]  PI! 

0010001010  0011001010  oiooiioool  OOOOOOOOC  S:  0 • 1 6520500E  05 
0000001000000000 

001C001010  OC 1 1001010  1100110001  100000000  S:  0. 1 652050CE  C5 

0000001000000000 

N:  64  R:  0 


**PF (33) :  0010001100 

COST:  0.40699976E  03  CPU:  93.119 

0010001100  0011001100  0 100 ; 1 00 1 1  OOOOOOOOl  S:  0.24560500E  05 
0000001000000000 

0010001100  0011001100  1100110011  100000001  S:  0.24560500E  05 
0000001 OOOOOOOOC 
N:  64  R:  0 

**PF(34):  0010010000 


COST:  0.3481S971E  03  CPU:  95.969 
0010010000  0011010000  0100101011  OOOOOOOOl 
0000001000000000 

00100100000011010000  1100101011  100000001 
OOCO 00 1000000000 
N !  96  R:  0 


S:  0.16160500E  05 
S:  0.161 60500E  05 
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**PF(35):  0010010010 


COST:  0.41779980E  03 
0010010010  OC 1 1010010 
0000001000000000 
0010010010  0C1 1010010 

0000001 ocoooocoo 

N:  64  P:  0 

**PF(36):  00100101C0 

COST:  0.44059985E  03 
0010010100  0C1 1010100 
0000001000000000 
00 1 00 1 0 1 OQ  0011010100 
0000001000000000 
0010010100  001 1010100 
000000’ 000000000 
0010010100  0011010100 
0000001000000000 
N:  64  R:  0 

**PF(37>:  001010U000 


**PF(38):  001010UCIC 
COST:  0 .384  ’ 997 IE  03 

ooioioooio  oo*  ; 100010 
onooooi ooooooooo 
0010100010  0011  1000  10 
0000001000000000 
OOIOIOOOIO  001 1 10CC10 
0000001000000000 
OOIOIOOOIO  001 1 100010 
0000001000000000 
N:  72  P  r  0 


CPU:  97.915 
0100101001  OOOOOOOOO 


rPII.  QQ  0*0 
01 00 1000 1 1  000000001 

0100101011  000000001 

1100100011  100000001 

1100101011  100000001 


CPI:  104.879 
C10C001101  000000100 

0100011001  00000100C 

1100001101  100000100 

1100011001  100001000 


S:  0,1 6520500E  05 
S:  0, 1 6520500E  05 


S:  0. 24560 500E  05 
S:  0.24560500E  05 
S:  0.24560500E  05 
S:  0.24560500E  05 


S:  0. 1 6160500E  05 
S:  0. 1 61 605C0E  05 
S:  C.16I60500E  05 
S:  0.1 6160500E  05 


S:  0 . 1 6520500E  05 
S:  0. 1 6520500E  05 
S:  0.1 6520500E  05 
S:  0. 1 6520500E  05 


1100101001  100000000 


COST:  0.3 1 459985E  03  CPU:  102.694 
C010100000  0011100000  0100001111  000000111 
0000001000000000 

0010100000  0011100000  01000 1*011  000001001 
0000001000000000 

0010100000  0011100000  1 100  301 1 1 1  1000001  11 
0000001000000000 

00101000000011100000  1100011011  100001001 
0000001000000000 
N:  96  P:  0 
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**PF(39):  CO  10 100 100 

COST:  0.40699976E  03  CPU:  107,052 

0010100100  0011  100100  0100010011  000000001  S:  0.2456C5C0E  05 
0000001000000000 

001010C100  0011100100  0100011011  000001001  S:  0.24560500E  05 
0000001000000000 

0010100100  0011100100  1100010011  100000001  S:  0.245605C0E  05 
0000001000000000 

0010100100  00 1 1 ! 00 1 00  1100011011  100001001  S:  0.24560500E  05 
0000001000000000 
N:  72  P:  0 

=*PF( 40) :  0010101000 

COST:  0.35059985E  03  CPU:  109  .207 

OO'OIOIOOO  0011101000  0100010011  000000001  S:  0.19760500E  05 
000000' 000000000 

00101010C0  0011  101000  <100010011  100C00001  S:  0 . 19  760500E  05 
0000001000000000 
N:  70.  R  s  0 

**PF(41 ):  0010101010 

COST:  0.420 1 9995E  03  CPU:  110.650 

0010101010  0011101010  0 10001000 1  000000000  S:  C . 20 1 20500E  05 
000000  1000000000 

0010101010  0011101010  1 1 000 1 000 1  100000000  S:  0.201 20500E  05 
0000001000000000 
N:  4?  F :  0 

**PF(42):  0010101 100 

COoi':  0.43099976E  05  CPU:  112.115 

001010)100  001 i 101 luO  0100010011  OOOOOOOt  ":  0.281 6C5C0E  05 
0000001000000000 

0010101  IOC  0011101100  1  10001001  1  100000001  S:  0.281 60500E  05 
000000  lOOOOOOCCO 
N:  4g  R:  0 

**PF(43):  0010110000 

COST:  0,37219971  E  03  CPU:  114.256 

0010110000  00171 100^0  0 10000 1011  000000001  S:  0 , 1 9 760500E  05 
0000001000000000 

0010110000  0011110000  1 10000  1011  10000000!  S:  C , 19 760500E  05 
OOOOOOIOOOOOOCOO 
72  P:  0 
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**PF(44):  0010110010 


COST:  0.441 79980E  03  CPU:  115.698 

0010110010  0011110010  0100001001  000000000  S:  0.2012C500E  05 
0000001000000000 

0010110010  0011110010  110000100!  100000000  S:  0.20120500E  05 
0000001000000000 
N:  48  R:  0 

**PF(45) :  0010110100 

COST:  0.46459985E  03  CPU:  117.153 

0010110100  0011110100  0100000011  000000001  S:  0.28160500E  05 
OOOOOC 1000000000 

0010110100  001 1 1 10100  010000101  1  000000001  S:  0. 28160 500E  05 
0000001000000000 

OOlOliOlOO  0011110100  '.100000011  10C00000I  S:  0.28160500E  05 
0000001000000000 

0010110100  0011110100  1100001011  100000001  S:  0.28160500E  05 
0000001000000000 
,M:  4p  P:  o 

**PF(46>:  00 1 1 000000 

COST:  0.3 1659985E  03  CPU:  121.024 

0011000000  0011000000  0100001111  000000111  S:  0, 13960500E  05 
0000001000000000 

0011000000  0011000000  0100011011  000001001  S:  0, 13960500E  05 
OOOOCO 1000000000 

0011000000  001  1000000  01 00 10  1 1 1 i  000000111  S:  0 . 1 3960500E  05 
0000001 000000000 

001100C000  0011000000  0100111011  000001001  S:  0. 13960500E  05 
0000001000000000 

0011000000  0011000000  1100001111  100000111  S:  0.13960500E  05 
0000001000000000 

001 1000000  0C1 1000000  1100011011  100001001  S:  0. 13S60500E  05 
0000001000000000 

001 1000000  001 1000000  1100101111  100000111  S:  0.13960500E  05 
0000001000000000 

0011000000  0011000000  1 1 00 1 1 1 0 1  I  100001001  S:  0.13960500E  05 

0000001000000000 

N:  128  R:  0 


**PF<47):  CO  1 1 OOOO  10 


COST?  0 .38619971 E  03 

CPI:  124. 

056 

001 1000010  001 1C00010 
0000001000000000 

0100001 101 

000000  IOC 

s: 

0, 1 4320500E 

05 

0011000010  00 1 1000010 
COOOOOIOOOOOOCOO 

010001 1001 

000001000 

Si 

0. 1  432050CE 

05 

001 1000010  0011000010 
0000001000000000 

ciooio; ioi 

000000  IOC 

Si 

0.1 4320500E 

05 

0011J00010  0011000010 
0000001000000000 

C1001 1 1001 

OOO'OIOOO 

S: 

0. 1 4320500E 

05 

001  1000010  0011000010 
0000001000000000 

1  100001101 

100000100 

S: 

C, 1 432050CE 

05 

0011000010  Oul 1000010 
OOOOOOIOOCOOOOOO 

110001 1001 

100001000 

S: 

0.1  4320  50  OE 

05 

001 1000010  0011000010 
0000001000000000 

1  100101101 

100000100 

S: 

0. !  43205C0E 

05 

0011000010  001 1000010 
0000001000000000 

110011 1001 

iOOOOlCOO 

S: 

0.M3  20  5C0E 

05 

N:  96  R:  0 


**PF<48):  OCl 1000100 


COST:  0.40899976E  03 

CPI1:  127. 

033 

0011000100  0011000100 
0000001000000000 

0100010011 

000000001 

S: 

0.253S0500E 

05 

0011000100  0011000100 
0000001000000000 

0100011011 

000001001 

Si 

0.22360500E 

05 

0011000100  001 1000100 
0000001000000000 

o 

o 

o 

o 

o 

OOOCOuOOl 

S: 

0.22360500E 

05 

0011000100  0011000100 
0000001000000000 

o 

o 

o 

o 

000001001 

S: 

0.2236C500E 

C5 

0011000100  00! 1000100 
■000001000000000 

1  1000! 00 1 1 

1000C0001 

s: 

0.22360500E 

05 

Liu!  .'000100  0011000100 
OOOOOOIOOCOOOOOO 

1100011011 

100001001 

Si 

0.2236C5CCE 

05 

0011000100  001  1  COO  ICC 
00C0001000000000 

1100110011 

100000001 

Si 

0.22360500E 

05 

001  1000100  0011000100 
0000001000000000 

1  1001  non 

10000100! 

Si 

0.2236C50CE 

05 

N;  96  R  t  0 


**PF<49):  0011001000 


COST:  0 ,3 5259985E  03 

CPU: 

12? 

.972 

ooi  looioro  coi  iooiooo 

OCOOOOloOOOOOOOO 

01000 

1C01 

1  00000C001 

s: 

0.1  756050CE 

05 

0011001000  OCl  1001000 
OOOOOOIOOCOOOOOO 

01001 1001 

1  0^0000001 

S: 

0 , 1  7560500E 

05 

001 1001000  001  1001000 
0000001000000000 

1  1000 

1001 

1  100000001 

S: 

0.1  75605C0E 

05 

0011  jOIOOO  COI  1CC1C00 
OOOOOCICCOOOOCOO 

N:  96  P:  C 

1  ICO  1 

IOC  1 

1  10C000001 

S : 

0.1  75605CCE 

05 
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**PF(50)s  001 10C1010 


COSTS  0.42219995E  03 
0011001010  0011001010 
0000001000000000 
0011001010  0011001010 
0000001000000000 
0011001010  0011001010 
0000001000000000 
C01 1001010  0011001010 
COOOOO 1000000000 
NS  64  Rs  0 

**PF(5! ):  CO  1 1 00 1 1 00 


**PF(52):  0011010000 

COST:  0 .3 741997  IE  03 
0011010000  0011010000 
0000001000000000 
0011010000  0011010000 
0000001000000000 
OOllOiOOOO  0011010000 
0000001000000000 
001 1010000  0011010000 
0000001000000000 
N:  96  Rt  0 

**PF(53) s  00 1 1 0 1 00 10 

COST:  0.44379980E  03 
001  1010010  0011010010 
COOOOO 1000000000 
0011010010  0011010010 
0000003000000000 
00110100.0  0011010010 
0000001000000000 
0011010010  001 1010010 
0000001000000000 
Ms  64  R:  0 


CPU:  131.965 
0100010001  000000000 

0100110001  000000000 

110001000!  100000000 

1100110001  100000000 


CPU:  136.840 
010000101!  000000001 

0100101011  000000001 

1 1000010 l 1  100000001 

1 1 00 1 01 0 i 1  100000001 


S:  0.1 7920500E  05 
S:  0. ) 7920500E  05 
S:  0,1 79°0500E  05 
S:  0.1 7920500E  05 


S:  0.25960500E  05 
S:  0.259 60 500E  05 
S:  0.25960500E  05 
S:  0.259 60500E  05 


S:  0 . 1 7560 500E  05 
S:  0 . 1 7560500E  05 
S:  0.1  7560  500k.  05 
S:  0, 17560500E  05 


S:  0. 1 7920500E  05 
S:  0.17920500E  05 
S:  0.1  ('920?00E  05 
S:  0 . 1 79C0500E  05 


CPU:  138.822 
0100001001  000000000 

0100101001  000000000 

1100001001  100000000 

1100101001  100000000 


COST:  0.43299976E  03 
0011001100  0011001100 
0000001000000000 
0011001100  001 1001100 
0000001000000000 
0011001100  0C1 1001 100 
0000001000000000 
0011001100  001 1001100 
0000001000000000 
Ms  64  P:  0 


PPHf  IXX  epl 
010001001  1  000000001 

0100110011  000000001 

1100010011  100000001 

1100110011  10000000! 
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**FF(54):  0011010100 

COST:  0.46659985E  03 
0011010100  001 1010100 
0000001000000000 
0011010100  0011010100 
0000001000000000 
0011010100  0011010100 
0000001000000000 
0011010100  00 1 1 0 1 0 1 oc 
0000001000000000 
0011010100  0011010100 
0000001000000000 
0011010100  OC 1 1 0 10 1 00 
0000001000000000 
0011010100  0011010100 
0000001000000000 
0011010100  0011010100 
0000001000000000 
N:  64  R:  0 

**PF(55):  0100000000 

COST:  0 .3  1259985E  03 
0100000000  C10000000C 
0000001000000000 
0100000000  0100000000 
0000001000000000 
0100000000  0100000000 
0000001000000000 
0100000000  0100000000 
0C0000100000C000 
0100000000  0100000000 
0000001000000000 
0100000000  0100000000 
00000010C00C0000 
0100000000  0100000000 
0000001000000000 
0100000000  0100000000 
0000001000000000 
0100000000  0100000000 
0000001000000000 
010000000C  OlOOOOOOOO 
0000001000000000 
OlOOOOOOOO  OlOOOOOOOO 
OOOOOOIOCOOOOOCO 
OlOOOOOOOO  OlOOOOOOOO 
0000001000000000 
N:  iso  P:  o 


CPU:  140.818 
0100000011  000000001 

0100001011  000000001 

01C010C011  000000001 

0100101011  000000001 

1100000011  100000001 

1 100001 C 1 1  100000001 

1100100011  100C00001 

1  1 00 1 0 1 C 1 1  100000001 

CPU:  145.736 
000 1101111  0001 00111 

0001111011  0001)1001 

OOllOOiill  001000111 

0011011011  001001001 

0014101111  001100111 

0011111011  001 101001 

1001101 1 1 1  00010011 1 

1001111011  000101001 

1011001111  001000111 

1011011011  001001001 

101 1 1C1 1 1 1  001100111 

1011111011  001 1C1001 


S:  0.259 60500E  05 
S:  0.259 60500E  05 
S:  0.259S0500E  05 
S:  0.25960500E  05 
S:  0.25S 60  500E  05 
S:  0,259 60500E  05 
S:  0 .259 6C500E  05 
S:  0.25960500E  05 


S:  0. 13360500E.  05 
S:  0.133 6050CE  05 
S:  0 . 1 3360500E  05 
S :  0 . 1 3360500E  05 
S:  C  .  1 3360500E  05 
S:  0. 1 J360500E  05 
S:  0. 133605CCE  05 
S:  0 . 1 3360500E  05 
S:  0. 133SC500E  05 
S:  0 . 1 3360500E  05 
S:  0 . 133S0500F  0? 
S?  0,133 30500E  05 
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**PF(56):  0100000010 


COST:  0.382 1997 IE  03 
0100000010  0100000010 
0000001000000000 
0100000010  0100000010 
0000001000000000 
0100000C10  0100000010 
0000001000000000 
0100000010  0100000010 
000000 1000000000 
0100000010  0100000010 
0000001000000000 
010000C010  0100000010 
0000001000000000 
0100000010  0100000010 
0000001000000000 
0100000010  0100000010 
0000001000000000 
0100000010  0100000010 
0000001000000000 
0100000010  0100000010 
0000001000000000 
0100000010  0100000010 
0000001000000000 
0100000010  01000C0010 
0000001000000000 
N:  128  R:  o 

*+PF( 57) j  0100000100 

COST:  0.38699976E  03 
0100000100  0101  COO  1  CO 
0000001000000000 
0100000100  0101000100 
0000001000000000 
N:  128  F:  0 

**PF(  58):  0100001000 

COST:  0 .3305998 5E  03 
0100001000  0101001000 
0000001000000000 
0100001000  C 1 0 1 00 1 000 
OOOOOOIOOOOOOCCO 
N:  1 44  P:  0 


CPU:  149.804 
0001101101  000100100 

0001111001  000101000 

0011001101  001000100 

0011011001  001001000 

0011101101  001100100 

0011111001  CO l 1 0 1 000 

1001101101  000100100 

1001111001  000101000 

1011001101  001000100 

1011011001  00100100C 

1011101101  001100100 

1011111001  001101000 

CPU:  153.735 
00101  1  101  1  ..00000100) 

1010111011  OOOOOiOOl 

CPU:  157.^69 
0010110011  000000001 


C:  0. 13720500E  05 
S:  0. 1 3720500E  05 
S:  0.13720500E  05 
S:  0.13720500E  05 
S:  0 •  1 3720500E  05 
S:  0.13720500E  05 
S:  0 . 1 3720500E  05 
S:  0. 1 3720500E  05 
S:  0 . 1 3720500E  05 
S:  0, 13720500E  05 
S:  0. | 3720500E  05 
S:  0.13720500E  05 


S:  0.20960500E  05 
C:  0 ,209 6050CE  05 


S:  0,161 60500E  05 
S:  0.161 60500E  05 


1010110011  000000001 
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**PF(59):  0100001010 

COST:  C . 4C0 1 9995E  03  CPU:  ISO. 807 

C 1 0000 1 C 1 0  0101001010  00 1 C 1 1000 1  oroocoooc  S:  C.165205C0E  C5 
000000 ICOCOOOOOO 

0100001010  0101001010  ICiOllDCOl  OOOOCOOOC  S:  C.1652C500E  05 
0000001 OOOOOOOOC 
Vi  96  R:  C 

**PF(60):  0 1 OOOC 1 100 

COST:  C  .4 1 099976E  03  CPU:  163,644 

01000C1100  0101001100  0010110011  000000001  S:  0.245605C0E  05 
0000001000000000 

0100001100  0101001100  1010110011  000000001  S:  0.2456050CE  05 
0000001000000000 
N:  96  P:  0 

**PF(61 ):  0100010000 

COST:  0 .352 1 997 1 E  03  CPU:  167.862 

0100010000  0 1 0 1 C 1 0000  0010101011  C00000001  S:  0,161 6C50CF  C5 
OCOOOOIOOCOOOOOO 

CIOCOIOCOO  0101010000  101010101  1  000000001  S:  0,16160 50CE  05 

0000001000000000 

Nt  144  P:  0 

**PF( 62 ) :  0100010010 

COST:  0.421 79980E  03  CPU:  170.689 

0100010010  0101010010  0010101001  OOOOOOOOC  S:  0 . 1 S520500E  05 
00000C1000000000 

OlOOOlOulO  OlO.OiOClC  !O«Cj0n01  000000000  S:  0 . 1 6520500E  05 
0000001000000000 
N:  96  P:  0 

**PF(63) :  0100010100 

COST:  0.44459985E  03  CPU:  173,470 

0100010100  0101010100  00101 0001 1  000000001  S:  0.2456C50CE  05 
OOOOCC 1000000000 

0U0010100  0101010100  0010101011  OOUOOOOOI  S:  0.2  4560 500E  05 
OOOOOOIOOOOCOOOO 

0 1 000  1 C 1 00  0 1 C 1 0 1 0 1 00  10101 000 1 1  000C00001  S:  0.245605CCF  05 
OOCCOOIOOOOOOOOO 

0<0001C100  0 1 0 1 C ! 0 1 00  10 10 10 10  1 1  C00000C01  ? :  0 .2^560 5CCF  05 

OOOOOOIOOCCOOCOO 

F:  S6  P:  0 
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**PF(64) 


0  IOC 1G0000 


COST:  0.31S59985E  03 
CICOICOOOC  C1C11COOOO 
COOOOOi 000000000 
0100100000  0101100000 
COOOOOICOOOOOOCO 
OlOOICCCOO  01C11000CC 
0000001000000000 
C100100000  ClOllOCOOO 
OOOOCCIOOOOCOCOC 
M*  128  P:  0 


CPU: 

177. 

CO  1000 1 1  1  1 

0010:1 

101 1 

101000 

mi 

101001 

1011 

000000111 
000001001 
COOOOOI 11 
000001001 


S:  0.161 
S:  0.161 
S:  0.161 
S:  0.161 


6C500E  05 
60500E  05 
60 5f  E  05 
60500E  05 


**PF(65):  C 1 C 0 1 0 00 1 0 


COST:  0.3P819971E  03 
01001C0010  01011 OOC  1 0 
OOOOOOIOOOOCOOCO 
01 001  COO  10  0101 100010 
OOCOOOIOOOOOOOOO 
0100100010  0101 10001C 
COOOCOIOOOOOOCCO 
0100100010  C 1C  1 1  GOO  1 0 
COOOOOI 000000000 
N:  96  P :  0 


CPU:  180. 
•00100C1  101 

124 

OCOOOOIOO 

S: 

0. 1 6520500E 

05 

00100! 1001 

C00001000 

S: 

0 . 1 6520500E 

05 

1010001 101 

OCOOOOIOO 

S: 

0. 1 6520500E 

05 

101001 1001 

000001000 

S: 

0.1 6520500E 

05 

**PF(56):  0100100100 


COST:  0.41099976E  03 
01001C0100  0101100100 
OOOOCOIOOOOOOOOO 
0  IOC  100 100  0101100100 
OOCOOOIOOOOOOOOO 
0100100100  C101 100100 
OOOOOOICOOCOOOOO 
0  1 CC 1 00 1 00  0101100100 
000000 1000000000 
N:  96  P:  0 


CPU: 

182. 

00 1001001 1 

0010C1 

10'  1 

101001 

0011 

101001 

101  1 

29 

000000001 

C00001001 

000C00001 

000001001 


S:  0.24560500E  05 
S:  0.2456C500E  05 
S:  0.24560500E  05 
S:  0 . 24560 500E  05 


**PF(67):  0100101000 


COST:  0.354599P5E  03 
0100101000  0101 101000 
OOCOOOIOOOOOOOOO 
010010  1 OCO  0101101000 
OOOOOOIOOOOOCCOO 
$6  F:  0 


CPU:  185,793 
0010010011  000000001 

1010010011  000000001 


S:  0.19760500E  05 
S:  0 . 1 9760500E  05 
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**PF(68):  0 100 1 0 1 0 i 0 


COST:  0 ,424 19995E  03  CPU:  187.687 

0100101010  0101 1 0 1 0 1 C  0010010001  OOOOOCOOO  S:  0.201 2050CE  05 
OCCOOO 1000000000 

0100101010  0101 101010  1010010001  OOCOOOOOO  S:  0.201 2C500E  05 
0000001000000000 
N:  64  P:  0 

**PF(69):  0100101  ICO 

COST:  0.43499976E  03  CPU:  189.593 

0100101  100  Old  101 100  00 1  CO  1 001 1  OOOOOOOCl  S:  0.28160500E  05 
OOOOCOIOOOOOOOOO 

0100101 100  Old  101 100  101001001,  OOOOOOOCl  S:  0.281605C0E  05 
0000001000000000 
N :  64  F  :  0 

**PF( 70) :  01001 10C00 

COST:  0.37619S71E  03  CPU:  192,402 

0100110000  0 1 0  1 1  10000  0010001011  CC0000001  S:  0. 1976050CE  05 
OOOCOOIOOOOOOOOO 

0100110000  0101110000  1010001011  OOOOOCOOl  S :  0.1S760500E  05 
0000001000000000 
N*  S6  P:  0 

**PF( 7 1 ) :  01001 1 00  1C 

COST:  0.44579980E  03  CPU:  194.287 

OlOdlOOlO  0101110010  0010001001  OOOCOOOOC  S:  0,201 2C500E  05 

0000001000000000  X 

0100110010  0 1 C 1 1 10010  1 0  1000 1  CO  1  OOOOJOOOO  S:  0,201 2050CE  05 

0000001000000000 

N:  64  P;  0 

**PF(  72 ):  C 1 00 1  1 0  1 OC 

COST:  0.46859985E  03  CPU:  196.172 

0100110100  01011  10 ICC  0010000011  000000001  S:  0.281 6C5C0E  0 5 
OOOOOCICOOOOOOOO 

0100110100  0101110100  0010001011  OOOOOOOCl  S:  0.281 60500E  05 
0000001000000000 

0100110100  0101110100  1010000011  000000001  S:  0.2816C500E  05 
OOOOOOIOOCOOOOOO 

01001 10100  0101  1  10100  1 0 1 000 1011  0C0000001  S:  0.281605C0E  05 

OOOOCOIOOOOOOOOO 

N:  64  P:  0 


**PF (73) 


OiOlOOOOOO 


COST:  0.32259985E  03 

CPU:  199. 

5  78 

0101C00C00  0101000000 

00000011 1 1 

000000111 

S: 

0. 13960500E 

05 

0000001000000000 
0101000000  0101000000 

000001 1011 

000001001 

S: 

0.1 3S60500E 

05 

0000001000000000 
01C1000000  OlOICCOOCO 

0000101 1 1 1 

00000011 1 

S: 

0.13S60500E 

05 

0000001000000000 
0101000000  0101000000 

C00011 101 1 

000001001 

S: 

0. 13960 500E 

05 

00C0C01 000000000 
0101000000  0101000000 

00100011 1 1 

000000111 

S: 

0.13960500E 

05 

0000001000000000 
0101000000  0101000000 

001001 1011 

000001001 

S: 

0. 13960 500E 

05 

0000001000000000 
0101000000  0101000000 

0010101111 

00000011  1 

S: 

0 . 1 3960500E 

05 

0000001000000000 
0101000000  OiOlOOOOOO 

001011  101 1 

000001001 

S: 

0 . 1 3960500E 

05 

0000001000000000 
0101000000  OiOlOOOOOO 

1000001111 

000000111 

S: 

0.13960500E 

05 

0000001000000000 
OiOlOOOOOO  OiOlOOOOOO 

1000011011 

000001001 

S: 

0 . 1 3960500E 

05 

0000001000000000 
OiOlOOOOOO  OiOlOOOOOO 

100010)1 1 1 

000000111 

S: 

0. 13960500E 

05 

COOOOOICOOOCOOOO 
OiOlOOOOOO  OiOlOOOOOO 

100011 1011 

000001001 

S: 

0. 13960 500E 

05 

OOCOOOIOOOOOOOOO 
OiOlOOOOOO  OiOlOOOOOO 

10100011 1 1 

00000011 1 

S: 

0, 13960500E 

05 

0000001000000000 
OiOlOOOOOO  OiOlOOOOOO 

101001  1011 

000001001 

S: 

0. 13960 500E 

05 

OOCOOOIOOOOOOOOO 
OiOlOOOOOO  OiOlOOOOOO 

10101011 1 1 

0C00C01 1 1 

S: 

0. 1 3960500E 

05 

OCOOOOIOOOOOOOOO 
OiOlOOOOOO  OiOlOOOOOO 

10 10  1 1101 1 

000001001 

S: 

0.13960 500E 

05 

OOOOOOIOOOOO^OOO 
N:  128  Ps  0 


4  55 


**PF( 74) 


0101000010 


COST:  C.3S2  19S71  E  03 

CPU:  203  . 

140 

0105000010  010100C010 
0000001000000000 

0000001 10 1 

OOOOOOlOO 

S: 

0 . 1 4320500E 

05 

0101000010  0101000010 
000000 1000000000 

C00001 1001 

000001000 

S: 

0 . 1 4320500E 

05 

0101000010  0101000010 
OOOCOOIOOOOOOOOO 

OOOOiCl 101 

OOOOOOlOO 

S: 

0 . 1 4320500E 

05 

OiOIOOOOlO  0101000010 
000000 10C0000000 

0000111001 

000001000 

S: 

0 . 1 4320500E 

05 

0101000010  0101C00010 
OOOCOOIOOOOOOOOO 

0010001101 

OCOOOOIOO 

S: 

0 . 1  43 20 50 OE 

05 

0-101000010  0101000010 
000000 1000000000 

0010011001 

000001000 

S : 

0. 1 43205CQE 

05 

0101000010  0101000010 
0000001000000000 

001010110! 

OOOOOOlOO 

S: 

0. 143205C0E 

C5 

0101000010  0101000010 
OOOCOOIOOOOOOOOO 

001011 >001 

000001000 

S: 

0.1 4320500E 

05 

0101000010  0101000010 
0000001000000000 

100000 ! 101 

OOOOOOlOO 

S: 

0 . 1 4320500E 

05 

0101000010  0101000010 
0000001000000000 

1000011001 

000001000 

S : 

0 . 1 4320500E 

05 

0101000010  0101000010 
OOOOOOIOOOOOOCOO 

1000101101 

OOOOOOlOO 

S: 

0 . 1 432O500E 

C  5 

C 1 0  1 0000 1 0  0 1 0 1 0000  1C 
OOOOOOIOOOOOOCOO 

1000111001 

000001000 

s : 

0 . 1 4320500E 

05 

0101000010  0101000010 
0000001000C00000 

1010001101 

ooooooioo 

S: 

0.1 4320500E 

Of- 

0101000010  0101000010 
0000001000000000 

1010011001 

000001000 

S : 

0 . 1 4320500E 

Of 

0101000010  0101000010 
0000001000000000 

101010110! 

OOOOOOlOO 

S: 

0  e  1  4320 vOOE 

05 

0101000010  0101000010 
OOOOOOlOOCOOOCr'O 

N:  96  P:  0 

1010111001 

000001000 

S: 

0 . 1 4320500E 

05 

**PF( 75) 


0301000100 


COST:  0.41 499976E  03 
01010001C0  0101000100 
0000001000000000 
0101000100  0101000100 
OOOOOOIOOOCCCOOO 

omioooioo  oioioo'  100 

OOOCOO 10000000C0 
0101000100  0101000100 
0000001000000000 
0101000100  0101000100 
000000 10000C0000 
0101000100  0101 OCO IOC 
0000001000000000 
0101000100  0101000100 
0000001000000000 
oioioooioe  cioiocoioo 
0000001 OOOC 00000 
0101000100  *11 01000 100 
0000001000000000 
oioioooioo  omoooioo 
oocoonioooooccoo 

0101000100  0 1 C  1000100 
000000  100C000C 00 
0 1  C 1 000  1  00  OKI 000  1 00 
ooooooiooooorooo 
0101000100  r  101000100 
0000001000000000 
OIOIOOOIOO  OIOIOOOIOO 
0000001000000000 
OIOIOOOIOO  OIOIOOOIOO 
OOOOOOIOOOOOOOCO 
C 1 0 1 OCO 1 OC  OIOIOOOIOO 
000000)000000000 
N:  96  ft:  0 


CPU:  206.257 
0000010011  C00000001 

0000011011  OOOOO'OOl 

000011001  I  0C000C001 

0000111011  000001001 

0010010011  000000001 

OC 100 1 1 C 1 1  00C001C01 

ooioiioon  oeocooooi 
ooioinrn  ooocoiooi 
1000010011  000000001 
1  OOOC  1  1 0  i  1  000.001001 
1000110011  000000001 
10001 !  1011  000001001 
101.001001  1  000000001 
101001 101 1  000001001 
10101 1001  1  000000001 
10  10 1 1  1C  1  1  00C001001 


S: 

0.2236050CE 

05 

S: 

0.223 60 5C0E 

05 

St 

0.2236050CE 

05 

?: 

0  223605CCE 

05 

S: 

0 .2236)5')CF 

05 

S: 

0.2236C500E 

05 

S : 

0 .223605*  CE 

05 

S : 

0.2236C50CE 

05 

s : 

0.22360500E 

05 

S: 

0,223  60  iOOE 

05 

S : 

0.2236050CE 

05 

S: 

0.223  60  5C0E 

05 

S : 

0.22360500E 

05 

S: 

0.22360500E 

05 

S: 

0.2236C500E 

05 

S: 

0.22360500E 

05 

457 


**PF(76):  C101CC100C 


COST:  0.35859985E  03  CPI':  209.414 


0101001000  0101001000 
0000001000000000 

00000  10011 

000CCC001 

S: 

0. 1  756C500E 

05 

0101001000  0101001000 
C000001000000000 

0000110011 

onooooooi 

Si 

0. J756C500E 

05 

0101001000  0101001000 
0000001 ooooonooo 

0010010011 

000000001 

S: 

0.1 7560 5CCE 

05 

0101001000  0101001000 
0000001000000000 

0010110011 

000000001 

S: 

0.1  7560 500E 

05 

0101001000  010I0010C0 
OOOOCO 1000000000 

1000010011 

000000C01 

S: 

0.1  756C500E 

05 

o io iooi ou :  'iioiooiooo 
0000001000000000 

1000110011 

000000001 

c  • 

k»  • 

0. 1  756C5CCE 

05 

0101001000  0101001000 
0000001000000000 

1010010011 

0C0000CC1 

c  • 

• 

0.1  75605C0E 

05 

0101001000  0101001000 
000000  lOCOOOOOOO 

1010110011 

0000C0001 

c  • 

k.  • 

0 . 1  756050CE 

05 

N:  96  Rs  0 


**PF(77>:  0101001010 


COST:  0 .42?  1 999 5E  03 

CPU:  211. 

48  5 

0101001010  0101001010 
0000001000000000 

0000010001 

000000000 

c,  • 

w  t 

0,1 7S205C0E 

05 

0101001010  01010C1010 
OOOOOOIOOOCOOOOO 

00001 100C1 

000000000 

• 

k'  • 

0.1 792C50CE 

05 

0101001010  0101001010 
00C00010000C0000 

001001C001 

oooooocoo 

s: 

0.1792C5CCE 

05 

0101001010  r io ioo i o i c 
0000001000000000 

ocioi loooi 

000000000 

S: 

0.17S2C500E 

05 

C 1 0 1 00 1 0 1 C  0 1 0 1 00 1C  1 0 
000000100C00C000 

1000010001 

oocoooooc 

S: 

C.17920500E 

05 

0101001010  C 1 0 1 00 1 0  1 0 
00C0001 000000000 

1000  1 100C1 

oooccoooo 

S: 

0.1792050CE 

05 

0101001010  0101001010 
OOOOOOIOOOOCCOOO 

1  C  3  CO  1000 1 

ooooooocc 

Sf 

0.1  7920500E 

05 

0101001010  C1010C1C1C 
OOOOOOIOCOOOOOCO 

M:  64  P :  0 

1010110001 

ooooooooc 

S: 

0.1  792C5CCE 

05 

458 


**PF(7f ) 


0101001100 


COST:  0.43899976E 
0101001100  0101001 
000000 1000000000 
0101001100  0101001 
0000001000000000 
0101001100  0101001 
0000001000000000 
0101001100  0101001 
0000001000000000 
0101001100  0101001 
0000001000000000 
0101001 iOO  0101001 
000000 1000000000 
OlCiOOllOO  0101001 
0000001000000000 
0101001 100  0101001 
0000001000000000 
N:  64  p  :  c 


03 

100 

CPU:  213. 
0000010011 

524 

000000001 

S: 

C.259605C0E 

05 

100 

0000110011 

OCOOOOOOl 

3 : 

0.25S6050CE 

05 

100 

0010010011 

000000001 

S: 

0.259 605C0E 

05 

100 

001011001 1 

000000001 

s: 

0.259605C0E 

05 

100 

100001001 1 

000000001 

S: 

0.25950500E 

05 

:oo 

100011001 1 

000000001 

S: 

C.25S6C500E 

05 

100 

10  1001C01 1 

000000001 

S: 

0.25960500E 

05 

100 

101011001  1 

000000001 

St 

0.25960500E 

C5 

**PF( 79) :  0101010000 


COST:  0.38019971 

E  03 

CPU;  216. 

473 

0101010000  0101C 
0000001000000000 

10000 

0000001011 

000000001 

S: 

0 . 1 75605CCE 

05 

0101010000  01010 
0000001000000000 

10000 

000CI010 1 1 

000000001 

S: 

0 . 1 75605C0E 

05 

0101010000  01010 
OOOOOOIOOOOOOCCO 

10000 

r  ,000101 1 

OCOOOOOOl 

S: 

0. 1 756C500E 

05 

0101010000  01010 
0000001000000000 

100C0 

001010101 1 

000000001 

S: 

0.1 7560 5C0E 

05 

0101010000  01010 
COCOOOIOOOOOOOOO 

100C0 

100000101 1 

000000001 

S: 

0 . 1 7560500E 

05 

0101010000  01010 
OOOOOOIOOOOOOCCO 

100C0 

100010101 l 

000000001 

c  • 

»•  • 

0 . 1  7560500E 

05 

0101010000  0101C 
0000001000000000 

1C000 

1 0 1  COO  1 C  1  1 

000000001 

S: 

Cel  7 560 5 OOF 

05 

0101010000  01010 
0000001000000000 
N:  96  P:  0 

10000 

io  ic  ic  ic  1 1 

000000001 

S: 

0.1 7560^008 

05 

459 


**PF(00):  0101010010 

COST:  0 .44979980E  03 

CPU:  218. 

531 

0 1 0 1 0 ! 00 1 0  0 10 10 10010 
OOOCOOlOOOOOOOOO 

0000001001 

000000000 

S: 

0. 1 79205C0E 

05 

OiOiOiOOlO  oioioioo 
0000001000000000 

10 

0000101001 

000000000 

S: 

0.17S2C50CE 

05 

0101010010  01010100 
OOOCOOlOOOOOOOOO 

10 

0010001001 

oooccoooo 

S: 

0.17920500E 

05 

0101010010  01010100 
0000001000000000 

1C 

oo i o  io i oo i 

oocoocooo 

S: 

0.17920500E 

05 

0101010010  01010100 
0000001000000000 

10 

1000001001 

000000000 

S: 

0.1 7920500E 

05 

0101010010  01010100 
000000100CC00000 

10 

1000101001 

000000000 

S: 

0.17920500E 

05 

0101010010  01010100 
0000001000000000 

10 

1010001001 

ooooocooo 

S: 

0.1  792050CE 

05 

0 1 0  1 0 1 00 1 0  01010100 
0000001000000000 

N:  64  P:  0 

10 

1010101001 

ooooocooo 

S: 

0.1792050CE 

05 

460 


iiv**'*  *“  V 


r 


**PF(81 ):  0101010100 


COST:  0.47259985E  03 

CPU:  220. 

589 

010 1010 100  0101010100 
000000 100C000000 

000000001 1 

000000001 

S: 

0.25960500E 

05 

01C1010100  0101010100 
000000 1000000000 

.000000101  1 

000000001 

S: 

0.259 60500E 

05 

0101010100  0101010100 
0000001000000000 

000010001 1 

00000000 l 

S: 

0.25960500E 

05 

0101010100  0101010100 
OOCOOOIOOOOOOOOC 

0000101011 

000000C01 

S: 

0.25960500E 

05 

0101010100  0101010100 
OOOOOOiOOOOOOOOO 

001000001 1 

000000001 

c . 

w  • 

0.259 60500E 

05 

0101010100  0101010100 
OOOOOOIOOOOOOOOO 

001000101 l 

000000001 

S: 

0.25960500E 

05 

0101010100  0101010100 
OOOOOOIOOOOOOOOO 

0010100011 

000000001 

S: 

0.259G0500E 

05 

0101010100  0 10  10 10 100 
OOOOOOIOOOOOOOOO 

0010101011 

000000001 

S: 

0.259 60 500E 

05 

0101010100  0101010100 
OOOOOOIOOOOOOOOO 

1000000011 

000000001 

S: 

0.25960500E 

05 

0101010100  0101010100 
OOOOOOIOOOOOOOOO 

100C00101 1 

00C000001 

S: 

0.259  60500E 

05 

0101010100  0101010100 
OOOOOOIOOOOOOOOO 

1000100011 

000000001 

3: 

0.25960500E 

05 

0101010100  0101010100 
OOOOOOIOOOOOOOOO 

1000101011 

000000001 

S: 

0.259  60500E 

05 

0101010100  0101010100 
OOOOOOIOOOOOOOOO 

101000001 1 

000000001 

S: 

0.25960500E 

05 

0101010100  0101010100 
OOOOOOIOOOOOOOOO 

1010001011 

oooooooot 

S: 

0.259 60500E 

05 

0101010100  0)01010100 
OOOOOOIOOOOOOOOO 

101010001 ! 

000000001 

S: 

0.25960500E 

05 

0101010100  0101010100 
OOOOOOIOOOOOOOOO 

1010101011 

000000001 

S: 

0.25960500E 

05 

N’S  64  »:  0 

PF(28>  CONTAINS  BEST  SOLUTI  ON  (S' )  .  7232  SOLUTIONS  CONSIDERED. 

MEAN  COST  OVER  ALL  SOLUTIONS:  0.46321021E  03 

MAXIMUM  COST  ENCOUNTERED:  0.67499951F  03 

**********  SOLUTION  TERMINATED  ********** 
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